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ANALYZING  AVAILABILITY  USING  TRANSFER  FUNCTION 
MODELS  AND  CROSS  SPECTRAL  ANALYSIS* 
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School  of  Engineering  and  Applied  Science 
Institute  for  Management  Science  and  Engineering 
Washington,  D.C. 


ABSTRACT 

If  we  look  at  the  literature  of  reliability  and  life  testing  we  do  not  see  much 
on  the  use  of  the  powerful  methods  of  time  series  analysis.  In  this  paper  we 
show  how  the  methods  of  multivariate  time  series  analysis  can  be  used  in  a 
novel  way  to  investigate  the  interrelationships  between  a  series  of  operating 
(running)  times  and  a  series  of  maintenance  (down)  times  of  a  complex  sys¬ 
tem  Specifically,  we  apply  the  techniques  of  cross  spectral  analysis  to  help  us 
obtain  a  Box-Jenkins  type  transfer  function  model  for  the  running  limes  and 
the  down  times  of  a  nuclear  reactor.  A  knowledge"  of  the  interrelationships 
between  the  running  limes  and  the  down  times  is  useful  for  an  evaluation  of 
maintenance  policies,  for  replacement  policy  decisions,  and  for  evaluating  the 
availability  and  the  readiness  of  complex  systems. 

1.  INTRODUCTION  AND  SUMMARY 

The  investigation  reported  here  was  undertaken  to  determine  if  a  stochastic  interrelation¬ 
ship  exists  between  the  runnng  times  and  the  down  times  of  the  Robinson  Nuclear  Power 
Plant.  The  data  was  provided  to  us  by  the  Probabilistic  Analysis  Staff  of  the  Nuclear  Regulatory 
Commission  (NRC).  Our  goal  was  to  understand  what  the  data  were  telling  us  about  the  rela¬ 
tionship  between  the  series  of  running  times  and  the  series  of  down  times.  One  way  of  achiev¬ 
ing  this  goal  is  to  obtain  a  Box-Jenkins  [1]  type  of  "transfer  function  model”  between  the  run¬ 
ning  times  and  the  down  times.  The  interpretation  and  uses  of  a  transfer  function  model  for 
the  situation  considered  here  are  discussed  in  Section  1.1. 

A  first  step  in  the  analysis  of  the  data  was  its  careful  screening.  This  was  done  in  order  to 
eliminate  those  observations  that  were  judged  to  be  questionable  or  that  had  arisen  under 
unusual  circumstances.  Such  observations  introduce  spurious  autocorrelations  and  cross  corre¬ 
lations,  and  thus  tend  to  obscure  the  identification  of  a  simple  relationship  that  may  exist 
between  the  running  times  and  the  down  times. 

We  would  like  to  emphasize  that,  for  an  analysis  of  data  of  the  type  discussed  here  (often 
referred  to  as  "messy  data"),  an  examination  and  screening  of  the  data  prior  to  model  building 
are  important  preliminary  operations.  If  one  neglects  to  perform  these  operations,  one  may 
face  the  frustrating  task  of  attempting  to  fit  several  transfer  function  models,  none  of  which 
may  be  satisfactory. 

‘Jointly  Sponsored  by  Contract  AT (49-24) -021 1  Nuclear  Regulatory  Commission  and  Program  in  Logistics  Contract 
N000I4-75-C-0729  Project  NR  347  020  Office  of  Naval  Research. 
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In  Figures  1.1  and  1.2,  we  display  a  time  sequence  plot  of  the  screened  down  times  X,  and 

the  corresponding  screened  running  times  Y„  t  -  1,  2 . 28.  Note  that  X\  represents  the 

first  down  time,  X2  the  second  down  time,  and  so  on,  whereas  F|  denotes  the  first  running 
time,  Y2  the  second  running  time,  and  so  on.  Note  also  that  the  two  plots  are  not  drawn  to  the 
same  scale.  In  Figure  1.3  we  indicate  the  relative  positions  of  the  X,'s  and  the  F,’s,  r  -  1,2, 


TIME  IN 
DAYS 


Figure  1.1.  Time  series  plot  of  screened  down  times 


In  transfer  function  model  building  observations  must  be  considered  in  pairs.  In  our  case, 
the  paired  observations  will  be 

U|.  J'l).  (*2.  ^ . <*21.  ^g). 

In  considering  the  above  pairs,  we  will  have  to  bear  in  mind  that  X,  precedes  Y,  in  chronologi¬ 
cal  time. 

In  Table  1.1  we  present  the  actual  values  of  the  screened  down  times  X,  and  the 
corresponding  screened  running  times  Y„  t  -  1,  2,  . . .  ,  28. 

If  changes  in  a  series  of  observations  Y„  t  -  1,  2 . tend  to  be  anticipated  by  changes 

in  another  series  of  observations,  say  t  -  1,  2,  . . .  ,  then  X,  is  said  to  be  a  leading  indicator 
of  Y,.  In  our  case  it  is  reasonable  to  assume  that  the  down  times  X,  are  a  leading  indicator  of 
the  running  times  Y,.  Based  upon  this  consideration,  and  together  with  an  analysis  of  the  avail¬ 
able  data,  albeit  an  insufficient  amount ,  the  best  transfer  function  model  we  have  identified  and 
fitted  is  given  as 
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Figure  1.2.  Time  series  plot  of  screened  running  times 
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TABLE  1.1  —  Screened  Down  Times  and 
Running  Times  of  the  Robinson  Nuclear 
Power  Plant 


Down  Times 
(days):  X, 

Prewhitened 
Down  Times 
(days):  a, 

Runnng  Times 
(days):  Y, 

Prewhitened 
Running  Times 
(days):  /3, 

0.87 

0.07 

22.83 

6.40 

0.26 

-0.54 

15.75 

-1.43 

0.42 

-0.30 

3.50 

-11.55 

0.95 

-0.09 

0.00 

-14.50 

1.13 

0.19 

21.00 

3.88 

0.06 

-0.78 

8.00 

—13.17 

0.19 

-0.47 

11.00 

-5.09 

0.02 

-0.84 

1.00 

-16.82 

0.08 

-0.81 

16.00 

-1.18 

0.06 

-0.86 

101.00 

80.84 

0.40 

-0.55 

0.58 

27.76 

0.01 

-0.99 

1.00 

2.97 

0.15 

-0.75 

0.83 

-10.46 

0.08 

' 

-0.89 

6.90 

-11.44 

0.17 

-0.80 

0.83 

15.85 

5.48 

4.51 

27.00 

8.86 

2.81 

1.21 

44.00 

22.01 

0.06 

—0.13 

8.00 

11.43 

0.08 

0.13 

17.00 

7.25 

0.98 

0.30 

25.00 

8.48 

0.87 

0.04 

17.00 

-0.57 

0.70 

-0.03 

10.30 

-3.64 

0.63 

-0.11 

11.36 

-3.64 

0.19 

-0.59 

2.81 

-13.93 

4.60 

3.85 

63.00 

47.04 

0.19 

-1.19 

8.00 

-17.05 

0.14 

0.13 

3.90 

-3.48 

0.41 

-0.12 

_ 

12.50 

-0.50 

(1.1)  K  -  10.630  -  6.6X,  -  0.55T,_,. 

Interpretation  and  uses  of  the  Transfer  Function  Model 

Transfer  function  models  are  generally  used  to  forecast  the  future  values  of  a  time  series 
V,  (in  our  case  the  running  times)  given  the  previous  values  of  the  leading  indicator  series  X, 
(the  down  times  in  our  case).  However,  forecasts  of  the  running  times  based  upon  the  previ¬ 
ous  and  present  values  of  the  down  times,  via  Equation  (1.1),  must  be  used  with  great  caution 
for  the  following  two  reasons.  First,  the  occurrence  of  unforeseen  but  rare  circumstances  may 
cause  the  future  running  times  to  be  abnormally  small  (or  even  large).  An  example  of  this  is  a 
reactor  shutdown  due  to  an  unforeseen  operator  error.  Second,  since  Equation  (1.1)  is  based 
on  only  28  observations,  it  may  not  be  too  reliable  as  a  model  for  forecasting.  We  can,  how¬ 
ever,  make  several  observations  of  practical  interest  based  upon  Equation  (1.1). 
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We  first  note  that  previous  values  of  K,,  such  as  K,_!,  F,_2 . etc.,  do  not  appear  in 

Equation  (1.1).  This  implies  that  the  running  lime  history  gives  us  little  information  about  the 
individual  future  running  times;  that  is,  the  next  running  time  may  be  unpredictable  from  a 
knowledge  of  the  previous  running  times.  However,  future  running  times  on  the  average  may 
be  estimated  from  prevous  running  times. 

An  important  consequence  of  Equation  (1.1)  is  that  the  running  times  Y,  appear  to  be 
strongly  influenced  by  the  immediately  preceding  down  times  X,.  Since  the  down  times  gen¬ 
erally  correspond  to  maintenance  actions,  we  can  make  the  following  conjecture: 

Conjecture :  Barring  unforeseeable  circumstances,  and  confining  ourselves  to  the  limits  of  the 
observed  data,  the  operating  times  are,  on  the  average,  increased  by  a  factor  of  about  six  per 
unit  increase  in  the  maintenance  times. 

An  explanation  to  support  the  above  conjecture  is  that  the  more  thorough  a  job  of  repair 
that  is  performed,  the  longer  the  next  running  time  becomes.  This  is  perhaps  one  of  the  most 
important  conclusions  that  can  be  reached  from  our  analyses. 

Since  the  coefficient  of  —0.55  associated  with  T,_j  is  small  compared  to  the  coefficient  of 
6.6  associated  with  X„  we  will  ignore  the  efTect  of  T,_,  on  Yt.  Even  though  the  transfer  func¬ 
tion  model  is  obtained  after  an  involved  analysis,  as  discussed  in  the  remainder  of  this  report, 

the  simplicity  of  Equation  (1.1)  suggests  that  a  plot  of  Y,  versus  X,,  t  =  1,2 . 28,  should 

be  approximately  linear.  The  actual  plot  confirms  the  reasonableness  of  Equation  (1.1),  includ¬ 
ing  the  values  of  its  coefficients. 

In  conclusion,  for  the  situation  considered  here  the  transfer  function  model  is  more 
efficient  as  a  tool  that  gives  us  some  insight  into  the  manner  in  which  the  system  operates, 
rather  than  as  a  tool  that  can  give  us  reliable  forecasts  of  future  running  times. 

The  remainder  of  this  paper  is  devoted  to  a  discussion  of  the  pertinent  details  that  lead  us 
to  our  model.  In  Section  2,  by  way  of  presenting  some  aspects  of  transfer  function  model 
building,  we  also  introduce  some  terminology  and  notation.  In  Section  3,  we  present  an 
analysis  of  our  data. 

In  what  follows,  we  require  the  reader  to  have  some  familiarity  with  the  material  in  Box 
and  Jenkins  [1]  and  with  that  in  Jenkins  and  Watts  [3]. 

2.  TRANSFER  FUNCTION  MODELS  AND  THEIR  ESTIMATION 

Univariate  transfer  function  models  as  described  by  Box  and  Jenkins  [1)  are  models  that 
specify  the  stochastic  interrelationships  between  two  time  series.  They  are  more  general  than 
regression  models  with  lag  structures  on  predetermined  variables,  in  that  the  dependent  vari¬ 
able  can  also  have  a  lag  structure.  In  addition,  the  transfer  function  models  can  have  a  super¬ 
imposed  error  structure  which  may  be  of  a  very  general  nature.  Engineers  often  refer  to  error 
with  the  term  "noise,"  and  "white  noise"  refers  to  errors  that  are  independent  and  identically 
distributed. 

There  are  two  equivalent  representations  of  a  univariate  transfer  function  model.  One  is 
the  infinite  or  reduced  form,  and  the  other  is  the  finite  form.  In  the  infinite  form,  the  output 
series,  say  Y,  (in  our  case  the  running  time),  is  explicitly  represented  as  a  function  of  the  input 
series  X,  (in  our  case  the  down  time)  and  its  lagged  (previous)  values;  that  is. 
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(2.1)  Y,  —  v0  +  V!  X,.\  +  v2  AT,_2  +  . . .  +  N,  , 

where  the  constants  v0,  vt,  ...  are  called  the  impulse  response  weights.  In  cases  where  there  is 

no  immediate  response,  one  or  more  of  the  initial  v’s,  say  v0,  . . .  is  equal  to  zero. 

The  process  /V,  represents  noise,  which  is  assumed  to  be  independent  of  the  level  of  the  input 
series,  but  is  additive  with  respect  to  the  influence  of  the  input;  N,  can  have  any  general  struc¬ 
ture. 

It  can  be  shown  that  an  equivalent  representation  of  the  model  given  by  Equation  (2.1)  is 
the  following  finite  form: 

(2.2)  Y,  —  8|  K,_|  —  ...  —  8,  K,_,  —  o)0  A",-*  —  <*>i  —  ...  —  u>s  +  N,  . 

where  the  8’s,  the  a<’s,  and  h  are  unknown  constants.  The  constant  b  associated  with  the  lead¬ 
ing  indicator  series  X ,  indicates  which  of  the  previous  values  of  X,  affect  the  present  Y,.  In  our 
application,  the  value  of  b  represents  the  number  of  previous  maintenance  times  affecting  the 
present  running  time. 

A  first  step  towards  estimating  the  transfer  function  model  is  a  tentative  identification  of 
the  values  of  r ,  b.  and  s.  This  can  be  accomplished  by  an  examination  of  the  estimated  impulse 

response  weights  vA,  k  **  0,  1.  2 .  A  plot  of  vA  versus  k  is  known  as  the  impulse  response 

/unction. 

There  are  two  general  approaches  for  obtaining  the  impulse  response  function.  The  first 
one,  outlined  by  U,  p.  3791,  is  based  on  a  "prewhitening"  of  the  input  series.  Prewhitening  the 
input  series  means  fitting  a  time  series  model  to  the  X,  series  such  that  the  residuals  from  the 
model,  say  <«,,  are  independent  and  identically  distributed  random  variables  with  mean  zero  and 
a  constant  variance.  When  the  vA  are  estimated  using  the  prewhitening  of  the  input  series  pro¬ 
cedure,  their  neighboring  values  tend  to  be  correlated.  Thus  the  graph  of  the  impulse  response 
function  tends  to  be  misleading.  This  ultimately  affects  our  ability  to  obtain  a  realistic  transfer 
function  model.  We  are  therefore  interested  in  considering  an  alternate  approach  for  estimating 
the  impulse  response  weights. 

The  second  approach  for  estimating  the  impulse  response  weights  involves  the  use  of 
"cross  spectral  analysis".  Such  an  approach  removes  the  difficulties  associated  with  the  problem 
of  the  correlated  estimates  of  vA,  and  also  provides  us  with  some  additional  insight  into  the 
nature  of  the  dependencies  between  the  input  and  output  series.  These  are  illustrated  at  the 
end  of  Section  3. 

Once  the  impulse  response  function  is  obtained,  we  can  isolate  the  noise  series,  N,.  by 
using  Equation  (2.1).  Specifically,  we  estimate  the  noise  series  by 

(2.3)  N,  -  Y,  -v0-v,  X,  _,-v2  X, 

A  knowledge  of  N,.  plus  a  knowledge  of  the  tentative  values  of  r,  b ,  and  s,  helps  us  to 
estimate  the  parameters  of  the  transfer  function  model.  Equation  (2.2).  One  way  of  accom¬ 
plishing  this  is  by  using  the  TIMES  program  package  described  by  Willie  17], 

The  adequacy  of  the  proposed  model  can  be  checked  by  an  analysis  of  the  residues  from 
the  model.  The  details  of  such  an  analysis  are  given  in  [1,  p.  392). 
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3.  ANALYSIS  OF  RUNNING  TIME  AND  DOWN  TIME  DATA 

Data  on  ihe  operating  history  of  nuclear  reactors  are  generally  available  showing  dates  on 
which  the  reactors  ceased  operation  and  the  duration  of  the  stoppage.  Among  other  facts,  the 
reasons  for  the  stoppage  are  also  given.  Stoppages  are  categorized  according  to  whether  they 
were  scheduled  or  forced.  In  addition  to  this,  there  is  a  further  breakdown  indicating  whether 
the  stoppage  was  due  to  equipment  failure,  testing,  refueling,  regulatory  reasons,  operator  train¬ 
ing,  administrative  reasons,  operational  error,  or  other  causes. 

3.1  Screening  the  Data 

The  data  that  were  given  to  us  described  the  Robinson  Power  Plant’s  operating  history 
from  June  1974  through  April  1976.  These  data  did  not  contain  any  stoppages  due  to  adminis¬ 
trative  reasons,  operational  error,  or  other  causes;  they  contained  one  stoppage  for  regulatory 
reasons  and  one  stoppage  for  refueling.  In  one  instance,  the  data  contained  an  unrealistic 
combination  for  the  cause  of  stoppage— a  scheduled  failure.  In  this  case  we  used  our  discretion 
to  alter  it  to  a  forced  failure.  Whenever  there  were  stoppages  due  to  operator  training,  these 
were  treated  as  running  times  rather  than  as  down  times.  This  was  done  for  two  reasons.  First, 
the  duration  of  each  stoppage  was  very  short  (on  the  average  about  1/2  hour);  second,  we 
would  like  to  concentrate  on  those  down  times  that  pertain  to  the  physical  operation  of  the  sys¬ 
tem  rather  than  on  those  external  to  it. 

The  single  stoppage  due  to  a  forced  regulatory  restriction  was  for  a  period  of  3.67  hours, 
and  since  it  immediately  followed  a  forced  equipment  failure  of  15.52  hours,  it  was  combined 
with  the  equipment  failure  stoppage. 

Refueling  the  reactor  takes  place  annually  and  is  generally  of  a  very  long  duration.  In  our 
data,  we  had  only  one  stoppage  for  scheduled  refueling,  and  it  was  of  960  hours  duration. 
Since  the  duration  of  this  stoppage  is  out  of  line  with  the  duration  of  the  other  stoppages  (see 
Table  1.1),  it  was  excluded  from  consideration. 

We  remark  here  that  any  time  a  stoppage  (running  time),  say  X,(Y,),  was  excluded  from 
consideration,  its  corresponding  running  time  (stoppage)  Y,  (X,)  was  also  excluded.  This  is  to 
ensure  that  no  bias  is  introduced  into  the  relationship  between  the  two  variables  of  interest 
because  of  the  elimination  of  observations  of  either  one. 

Another  convention  followed  in  our  analysis  arises  from  the  fact  that  the  raw  data  show 
the  dates  and  the  duration  (in  hours)  of  each  stoppage,  rather  than  the  actual  time  of  stoppage. 
We  assume  that  each  down  time  commenced  at  0000  hours  (unless  in  some  rare  instances  there 
is  a  second  stoppage  occurring  during  the  same  day).  Whenever  two  or  more  breakdowns 
occurred  during  the  same  day,  they  were  combined  into  one  down  time  period  and  the  inter¬ 
vening  operating  period  was  ignored. 

Because  of  the  paucity  of  data,  we  chose  not  to  distinguish  between  stoppages  due  to 
equipment  failure  and  those  due  to  testing.  This  is  reasonable  because  whenever  there  is  a 
forced  equipment  faiure,  maintenance  and  test  actions  on  other  (nonfailed)  components  are 
routine.  Thus,  in  practice  it  is  difficult  to  differentiate  clearly  between  the  consequences  of 
equipment  failure  and  those  of  testing.  This  strategy  was  suggested  by  some  staff  members  of 
the  Probabilistic  Analysis  Staff  at  NRC. 

The  preliminary  screening  and  examination  described  above  gives  us  a  series  of  values  of 
the  down  times  X,  (in  days)  and  a  series  of  values  of  the  corresponding  running  times  )',  (in 
days).  However,  as  we  shall  soon  see,  some  further  screening  is  necessary. 
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The  next  step  in  our  analysis  involved  prewhitening  the  X,  series.  This  turned  out  to  be 
quite  a  frustrating  endeavor,  since  no  simple  univariate  time  series  model  of  the  Box-Jenkins 
type  seemed  to  provide  a  reasonable  fit.  The  difficulty  turned  out  to  have  been  caused  by  two 
unusually  large  down  times  due  to  scheduled  testing  and  forced  failure  of  18.08  and  25.43  days, 
respectively.  These  were  incompatible  with  the  other  down  times  (see  Table  1.1),  and  thus 
defied  the  use  of  a  simple  model  as  a  prewhitening  transformation.  Perhaps  a  time  series 
model  with  an  indicator  variable  (such  as  those  used  in  the  "intervention  analysis"  of  Box  and 
Tiao  (21)  might  have  been  adequate  for  these  and  for  the  refueling  stoppage,  but  this  was  not 
attempted.  In  the  interest  of  expediency,  it  was  preferable  to  eliminate  the  two  large  X,'s  and 
their  corresponding  Y,' s.  Thus,  in  effect,  some  data  screening  was  done  during  the  prewhiten¬ 
ing  phase.  Table  1.1  presents  the  28  screened  values  of  the  down  times  X,  and  the  correspond¬ 
ing  running  times  Y,.  We  remind  the  reader  that  the  subscript  t  is  a  sequential  index  rather 
than  an  index  representing  time.  That  is,  X,  and  Y,  are  not  simultaneously  observed  in  time; 
X,  precedes  Y,. 

3.2  Transfer  Function  Model  Development 

In  Figures  3.1  and  3.2  we  show  plots  of  the  autocorrelation  functions  of  the  (screened) 
down  times  X,  and  the  (screened)  running  times  Y,,  respectively.  Based  upon  these  plots  we 
are  able  to  conclude  that  the  two  time  series  can  be  treated  as  stationary  [1,  p.  174], 
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Figure  3  1  Esiimaicd  aulocorrelalions 
of  down  limes  V, 


In  Figure  3.3  we  show  a  plot  of  the  logarithm  of  the  smoothed  estimator  of  the  power 
spectrum  of  the  screened  down  times  Xn  as  well  as  the  approximate  95%  confidence  limits. 
The  smoothing  was  performed  using  a  rectangular  window  of  band  width  .098.  As  a  matter  of 
fact,  all  the  smoothing  that  is  discussed  in  this  paper  was  performed  using  a  rectangular  window 
of  band  width  .098.  The  power  spectrum  curve  shows  us  how  the  variance  of  the  X,  time  series 
is  distributed  with  frequency.  For  a  detailed  understanding  of  the  power  spectrum,  its  smooth¬ 
ing,  and  the  band  width  of  a  smoothing  window,  we  refer  the  reader  to  Chapter  6  of  Jenkins 
and  Watts  [31. 


SPECTRUM 


Figure  3.3  Logln  of  the  smoothed  estimator  of  the  spectrum  of  down  limes  versus  frequency,  using 
rectangular  window  of  band  width  0.98 
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Our  next  step  involves  the  determination  of  a  suitable  prewhitening  transformation  for 
the  down  times  X,.  Based  upon  the  several  models  that  we  attempted,  we  conclude  that  a  mov¬ 
ing  average  process  of  order  3  best  describes  the  X,  series.  Specifically,  we  find  that 

(3.1)  X,  -  0.7963  -  a,  +  0.117a, -  0.189a, _2  -  0.133a„j, 

where  7963  is  the  mean  of  the  X.  series.  The  a,  represent  the  residuals  when  a  moving  aver¬ 
age  process  of  order  3  is  fitted  to  the  X,  series.  If  the  prewhitening  transformation  given  by 
Equation  (3.1)  is  correct,  then  the  a,  will  be  independently  and  identically  distributed  with  a 
constant  mean  and  variance. 

In  Table  1.1  we  give  the  values  of  the  a,'s.  In  order  to  verify  the  appropriateness  of  the 
model  given  by  Equation  (3.1),  we  plot  the  estimated  autocorrelation  function  and  the 
estimated  power  spectrum  of  the  a,  series.  These  plots  are  given  in  Figures  3.4  and  3.5,  respec¬ 
tively.  We  remark  that  the  plot  of  the  estimated  power  spectrum  of  the  a,  series  given  in  Fig¬ 
ure  3.5  is  relatively  constant  as  compared  to  the  plot  of  the  estimated  power  spectrum  of  the  X, 
series  given  in  Figure  3  3.  This  is  because  the  effect  of  prewhitening  is  to  remove  the  depen¬ 
dencies  among  the  ,V,'s  and  give  us  a  set  of  independent  a,.  Figures  3.4  and  3.5  confirm  the 
appropriateness  of  the  prewhitening  transformation  given  by  Equation  (3.1). 
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Figure  3  4  Estimated  autocorrelations  of  residuals 
from  the  model  used  for  prewhitening  V, 


Following  Box  and  Jenkins  [1,  p.  380],  we  next  apply  the  same  prewhitening  transforma¬ 
tion  (i.e.,  the  one  given  by  Equation  (3.1))  to  the  running  times  Y,  and  obtain  the  /3,'s  as  resi¬ 
duals.  In  Table  1.1  we  give  the  values  of  0,  under  the  heading  "Prewhitened  Running  Times." 
In  Figure  3.6  we  show  a  plot  of  the  smoothed  estimator  of  the  power  spectrum  of  /8,.  We 
remark  that  except  at  the  very  low  frequencies,  the  plot  of  the  estimated  power  spectrum  of  the 
£,'s  is  fairly  constant.  Thus  it  appears  that  the  prewhitening  transformation  given  by  Equation 
(3.1),  when  applied  to  the  running  times  T„  also  yields  a  sequence  of  independent  and  identi¬ 
cally  distributed  random  variables  0,. 
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Figure  3.5.  Log10  of  i he  smoothed  estimator  of  ihe  spectrum  of  the  prewhiiened  down  limes  versus  frequency. 

using  rectangular  window  of  band  width  098 
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Figure  3.6.  Log,()  of  the  smoothed  estimator  of  the  spectrum  of  H,  versus  frequency,  using  a 
rectangular  window  of  band  width  .098 
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Our  next  step  is  to  obtain  the  cross  correlation  between  the  a,  and  /3,  at  lags  k,  k  —  0,  1, 

2 .  If  s„  and  denote  the  estimated  standard  deviations  of  the  a,  and  the  /3,  series, 

respectively,  and  if  ritfS(k)  denotes  the  estimated  cross  correlation  between  the  a ,  and  the  /3,  at 
lag  A,  then  vA,  an  estimate  of  vA,  is 

vA  -  raP(k)  .  k  -  0,  1,  2 . 

$01 

(see  Box  and  Jenkins  [1,  p.  380]). 

In  Figure  3.7  we  show  a  plot  of  the  impulse  response  function;  that  is,  a  plot  of  vA  versus 
A,  A  -  0,  1,  2,  ...  .  We  remark  that  in  Figure  3.7,  the  value  v0  is  significantly  larger  than  the 
other  values  of  vA,  k  -  1,  2,  ...  . 
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Figure  3.7.  The  impulse  response  function  using 
the  prewhiiening  of  Hie  inpui  series 


Because  the  neighboring  values  of  the  vA  in  Figure  3.7  tend  to  be  correlated,  we  also 
obtained  the  impulse  response  function  using  the  "cross  spectrum"  between  the  X,  and  the  Y, 
series  (see  Jenkins  and  Walts  [3,  p.  424]).  In  Figure  3.8  we  show  a  plot  of  the  impulse 
response  function  using  the  cross  spectrum.  Note  that  this  plot  is  quite  similar  to  the  one  given 
in  Figure  3.7;  that  is,  v0  is  significantly  larger  than  the  other  values  of  vA.  Based  upon  Figures 
3.7  and  3.8,  we  can  conclude  that  (he  greatest  influence  on  the  running  lime  is  exerted  by  the  down 
lime  immediately  preceding  ii.  This  is,  of  course,  a  major  point  of  our  conclusions. 

We  now  estimate  the  noise  series  /V,  using  Equation  (2.3).  An  inspection  of  the 
estimated  autocorrelation  function  of  the  estimated  noise  series  A,,  and  a  "portmanteau  lack  of 
fit  test"  (1.69  with  8  degrees  of  freedom)  (1,  p.  290],  lead  us  to  conclude  that  the  N,  are 
independent  and  identically  distributed.  In  addition  to  the  above,  we  show  in  Figure  3.9  a  plot 
of  the  smoothed  estimator  of  the  power  spectrum  of  the  N,.  Here  again,  except  at  the  very  low 
frequencies,  the  estimated  power  spectrum  of  the  N,  series  is  fairly  constant.  Thus  the 
estimated  noise  series  N,  can  be  described  reasonably  well  by  a  white  noise  process. 

A  final  step  in  the  analysis  involves  the  fitting  of  a  transfer  function  model  to  the  running 
times  Y,.  This  was  accomplished  by  using  the  TIMES  program  package.  Of  the  several  models 
that  were  attempted,  the  model 
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(3.2)  Y,  -  10.630  -  6.6*,  -  0.55*, 

appears  to  be  the  best;  10.630  is  the  mean  of  the  Y,  series. 

In  order  to  verify  the  reasonableness  of  the  model,  two  diagnostic  checks  were  suggested 
in  [11.  One  depends  on  the  autocorrelation  of  the  A',  and  the  other  depends  on  the  estimated 
cross  correlation  between  the  jV,  and  the  a,.  For  both  cases  a  portmanteau  lack  of  fit  test  was 
used.  In  the  former  case,  the  test  statistic  is  1.69  with  8  degrees  of  freedom,  whereas  in  the 
latter  case  the  test  statistic  is  .698  with  6  degrees  of  freedom.  These  values  support  the  reason¬ 
ableness  of  the  proposed  transfer  function  model. 

As  stated  earlier,  a  cross  spectral  analysis  of  the  *,  and  the  Y,  series  can  give  us  further 
insight  into  the  nature  of  the  dependencies  between  the  two  series.  For  example,  </(/),  the 
"gain"  at  frequency  /,  behaves  like  the  regression  coefficient  in  a  linear  regression  model 
between  the  output  and  the  input  at  frequency  /  (see  Jenkins  and  Watts  [3,  p.  3521.  In  Figure 
3.10  we  plot  the  gain  of  the  running  times  on  the  down  times  at  various  frequencies.  Another 
important  function  in  cross  spectral  analysis  is  the  "squared  coherence"  between  the  input  and 
the  output  at  frequency  /.  This  quantity  measures  the  correlation  between  the  sinusoidal  com¬ 
ponent  of  Y,  and  that  of  *,  at  frequency  f  The  square  coherence  is  also  in  some  sense  a  meas¬ 
ure  of  the  proportion  of  information  in  the  Y,  series  that  is  attributable  to  the  *,  series.  For 
more  information  on  the  coherence  and  the  coherence  spectrum,  we  refer  the  reader  to  Jenkins 
and  Watts  [3,  p.  3521.  In  Figure  3.11  we  show  a  plot  of  the  coherence  for  the  running  time  and 
down  time  data.  Figures  3.10  and  3.1 1  give  us  some  additional  assurance  on  the  dependence  of 
the  running  times  on  the  down  times. 
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Figure  3.10.  The  gain  versus  frequency  of  running  times  on  down  times 
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Figure  3.11.  The  coherence  versus  frequency  of  running  limes  and  down  limes 
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4.  SUMMARY  AND  CONCLUSIONS 


In  the  foregoing  analysis  we  have  demonstrated  the  use  of  time  series  analysis  methodol¬ 
ogy  for  studying  the  interrelationships  between  the  maintenance  times  and  the  running  times  of 
a  complex  system.  Our  analysis  enables  a  decision  maker  to  assess  the  impact  of  his  mainte¬ 
nance  policies  on  running  times,  or  to  influence  the  operating  times  by  managing  the  mainte¬ 
nance  times.  In  addition,  given  a  down  time  a  decision  maker  can,  to  a  limited  extent ,  forecast 
the  next  running  time.  This  type  of  information  may  be  very  valuable,  especially  for  large  and 
complex  systems. 

Our  analysis  can  be  criticized  on  the  grounds  that  it  is  based  on  an  insufficient  amount  of 
data.  We  hope  that  this  criticism  can  be  overlooked  in  the  light  of  the  fact  that  our  approach  is 
to  be  viewed  as  a  prototype  for  the  analysis  of  reliability  data  involving  two  interrelated  sources 
of  data.  For  other  uses  of  the  time  series  analysis  methodology  for  reliability  and  life  data,  we 
refer  the  reader  to  Singpurwalla  [4],  (5]  and  [6], 
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ABSTRACT 

We  consider  a  model  with  M  +  ,V  identical  machines.  As  many  as  V  of 
these  can  be  working  at  any  given  time  and  the  others  act  as  standby  spares. 

Working  machines  fail  at  exponential  rale  K,  spares  fail  at  exponential  rate  y. 
and  failed  machines  are  repaired  at  exponential  rale  u  The  control  variables 
arc  A.  u,  and  the  number  of  removable  repairman.  .V.  to  be  operated  at  any 
given  time  Using  the  criterion  of  total  expected  discounted  cost,  we  show  that 
A.  V  and  u  are  monolonic  functions  of  the  number  of  failed  machines  \t.  V. 
the  discount  factor,  and  for  the  finite  lime  horizon  model,  the  amount  of  lime 
remaining 

1.  INTRODUCTION 

In  this  paper  we  examine  the  structure  of  the  optimal  maintenance  and  repair  policies  for 
an  exponential  repair  model.  We  assume  that  there  are  M  +  N  identical  machines,  N  of  which 
can  be  working  at  any  given  time.  The  others  function  as  standby  spares.  The  working 
machines  fail  at  rate  A,  and  the  failed  machines  are  repaired  at  rate  /u-  We  also  include  the  pos¬ 
sibility  of  spare  machines  failing  at  rate  y.  Our  control  variables  are  A,  which  can  be  thought  of 
as  a  maintenance  control  variable,  /a,  and  the  number  of  removable  repairmen,  5.  Our  princi¬ 
ple  results  are  that  A  decreases  and  S  and  pt  increase  as  more  machines  are  in  the  failed  state. 
(Throughout  this  paper,  we  use  the  terms  increasing  and  decreasing  to  mean  nondecreasing  and 
nonincreasing,  respectively.)  We  also  show  how  these  control  variables  behave  as  functions  of 
M,  /V,  the  discount  factor,  and  the  amount  of  time  remaining. 

Our  basic  model  has  been  studied  by  several  researchers.  Some  of  these  have  described 
the  model  and  given  its  operating  characteristics  under  exponential  and  nonexponential  assump¬ 
tions.  See,  for  example,  Barlow  [3],  Gnedenko,  Belyayev,  and  Solovyev  [8),  and  Iglehart  and 
Lemoine  [11],  [12].  There  has  been  less  literature  on  the  control  of  these  models.  Goheen  [9] 
and  Cinlar  [4]  study  the  problem  of  where  to  send  failed  machines  when  there  are  several 
repair  facilities.  Cinlar  assumes  exponential  times  and  reduces  the  problem  to  a  linear  program¬ 
ming  problem,  whereas  Goheen  assumes  Erlang  times  and  reduces  to  the  problem  to  a 
mathematical  programming  problem. 

More  in  line  with  the  model  presented  in  the  current  paper  are  the  papers  by  Crabill  [5] 
and  Winston  [18].  Each  considers  a  single-repairman  system  (and  Winston  assumes  no  spares) 
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which  can  be  operated  at  a  finite  number  of  service  rates,  and  each  shows  that  the  optimal  ser¬ 
vice  rate  is  an  increasing  function  of  the  number  of  failed  machines.  Crabill  does  this  with  a 
long  run  average  cost  criterion  by  means  of  tedious  manipulations  of  policy  improvement  equa¬ 
tions.  Winston  uses  a  discounted  cost  criterion,  and  he  builds  up  his  infinite  horizon  model  as 
a  sequence  of  discrete  lime  //-period  problems.  He  then  analyzes  these  quite  easily  be  means  of 
induction  on  //.  This  type  of  approach  is  also  used  in  the  present  paper  although  our  //-period 
model  is  defined  differently  than  Winston’s.  This  approach  allows  us  to  obtain  more  results, 
and  to  do  so  by  simpler  means,  than  was  evidently  possible  for  Crabill. 

2.  THE  BASIC  MODEL 

In  this  section  we  present  the  basic  model  and  main  results.  The  system  we  will  study  is 
composed  of  N  +  M  identical  machines.  As  many  as  N  of  these  can  be  working  simultaneously 
in  parallel,  while  the  rest  function  as  warm  standby  spares.  Each  of  the  machines  which  is 
currently  working  fails,  independently  of  the  others,  with  exponential  rate  X.  Whenever  one  of 
these  machines  fails,  it  is  immediately  replaced  by  a  spare  if  any  is  available.  We  also  assume 
each  of  the  available  spares  fails,  independently  of  the  other  machines,  with  exponential  rate  y. 
(This  is  the  rationale  behind  the  term  "warm  standby”.)  There  is  a  repair  facility  with  as  many 
as  N  +  M  removable  servers  in  parallel.  Whenever  a  working  machine  or  a  spare  fails,  it  is 
sent  to  this  facility,  where  it  is  served  on  a  first-come-first-served  basis  with  exponential  rate  fx. 

The  costs  and  rewards  are  as  follows.  Whenever  j  machines,  0  <  j  <  /V,  are  working, 
revenue  is  earned  at  a  rate  //(./ ),  where  //  is  an  increasing  function  with  //( 0)  =  0.  The  service 
cost  for  having  S  servers  each  working  at  rate  m  is  S/ifx),  where  /is  a  nonnegative  increasing 
function  of  m  Also,  because  we  will  wish  to  control  X,  we  assume  the  cost  of  having  / 
machines  working,  each  subject  to  failure  rate  X,  is  ./x'(X).  Here  a  is  a  decreasing  function  of  X 
and  can  be  thought  of  as  a  maintenance  cost.  Finally,  we  will  continuously  discount  all  reve¬ 
nues  and  costs  by  a  discount  rate  <«  >  0.  Our  objective  is  to  maximize  expected  discounted 
rewards  over  an  infinite  horizon.  The  decision  variables  at  any  decision  point  are  the  number 
of  servers  5,  the  service  rate  per  service  /u,  and  the  failure  rate  per  working  machine  X.  These 
must  satisfy  0<S<A(+A/,  0<  Xmin  <  X  <  Xmax  <  °°,  and  0  <  ^min  <  M  <  Mmax  <  °°- 
We  assume  the  failure  rate  for  spares,  y,  is  a  fixed  constant. 

The  problem  is  formulated  most  naturally  as  a  continuous  time  Markov  decision  model. 
The  state  space  is  |0,  1,  2,  . . .  ,  N  +  M },  where  state  /  means  that  /  machines  are  currently  in 
repair  or  waiting  for  repair.  The  actions,  transition  rates,  and  reward  rates  are  easily  obtained 
from  the  above  description  of  the  model.  For  example,  in  slate  /,  let  N,  and  M,  be,  respec¬ 
tively,  the  number  of  working  machines  and  the  number  of  available  spares.  Then  N,  = 
(/V  +  M  —  /)  A  N  and  M,  =  0  V  (M  —  /),  where  "A"  and  "V"  stand  for  "min"  and  "max".  This 
implies  that  if  we  are  in  state  /  and  use  actions  S,  X,  and  fi,  the  transition  rates  q,,(S.  X,  fx)  are 
</,,  +  i (5.  X,  ft)  =  /V,X  +  A/,y,  X.  m)  =  (S  A  i)n,  and  Q„(,S.  X,  /x)  = 

~/V,X  -  M,y  -  (5  A  i)fx.  The  reward  rate  in  state  /  is  r,(S,  X,  fx)  =  //(/V,)  -  <V,k(X)  - 
Sf(fx).  Because  of  the  exponential  assumptions  and  infinite  time  horizon,  we  may  assume  that 
decisions  are  made  only  at  failure  times  and  at  service  completion  times. 

Let  V(i)  be  the  maximal  expected  discounted  reward  which  can  be  obtained  when  starting 
in  state  i.  Then  it  follows  from  well-known  results  (see,  for  example,  Howard  1 1 0)  or  Ross 
(16])  that  F(/)  satisfies  a  certain  functional  equation  and  that  the  solution  to  this  equation 
yields  the  optimal  actions  to  use  in  state  /.  In  this  paper,  however,  we  are  interested  in  struc¬ 
tural  properties  of  the  optimal  actions  and,  in  particular,  how  they  behave  as  functions  of  i.  To 
discover  these  properties,  the  above  functional  equation  is  unfortunately  not  very  useful. 
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Instead,  it  is  belter  to  build  up  our  infinite  horizon  problem  as  a  sequence  of  ^-period  discrete 
time  problems.  This  enables  us  to  use  induction  arguments  (on  ti )  to  prove  the  desired  struc¬ 
tural  properties  of  the  optimal  actions. 

To  this  end,  we  define  a  finite  horizon  discrete  lime  Markov  decision  model  which  in  the 
limit  (on  the  number  of  periods)  is  equivalent  to  our  original  problem  in  the  sense  that  each 
gives  the  same  optimal  expected  rewards  and  each  prescribes  the  same  optimal  actions.  This 
method  of  using  a  discrete  time  problem  to  prove  structural  properties  of  a  continuous  time 
problem  was  first  utilized  by  Lippman  [14],  and  has  since  been  discussed  and  used  by  several 
other  authors  [1],  [7],  [13],  [17].  An  especially  lucid  description  of  the  relationship  between 
the  two  problems  may  be  found  in  Anderson’s  thesis  [2],  Because  the  method  is  by  now  quite 
well-known,  we  will  only  sketch  the  procedure.  The  reader  is  referred  to  the  above  references 
for  further  details. 

Let  A  =  ;VXmax  +  My  +  (N  +  M)pmix,  so  that  A  >  A,  p)  for  all  states  /  and 

actions  5,  A ,  m  The  possible  states  and  actions  of  the  discrete  time  process  are  the  same  as  for 
the  continuous  time  process,  but  the  transition  probabilities,  rewards,  and  discount  factor  are 
modified  as  follows.  The  transition  probabilities  are  piJ+,  (S,  A,  p)  =  <7,,/+|(S,  A,  p)/A  = 
(iV,A  +  A/,y)/A,  p,,_|(S,  A.  p)  =  q,j-\(S,  A,  p)/A  =  (S  A  /'V./A,  and  p„(S,  A,  p)  = 
(A  +  A,  p))/A  =  (A  -  Nj A  -  M:y  -  (5  A  i)p)/A.  The  one-period  rewards  are 

7,(5,  A,  p)  =  r,(S,  A,  p)/ ( A  +  a)  =  ( h(N ,)  -  N/gM  -  Sf(p))/( A  +  a),  and  the  discount 
factor  is /3  =  A/ (A  +  a). 

Let  y„(i)  be  the  maximal  expected  discounted  reward  over  n  periods  for  the  above 
discrete  time  process.  Then  it  follows  from  the  previously  referenced  results  that  V„(i)  con¬ 
verges  to  V(i)  and  the  optimal  actions  for  the  n-period  problem  converge  to  the  optimal  actions 
for  the  continuous  time  infinite  horizon  problem.  Furthermore,  it  is  easy  to  see  that  V„(i) 
satisfies  the  following  recursive  equation: 

y„(i)  =  max  [J„(i,  S.  A,  /x) }/( A  -I- a), 

S.  K.  M 

where  F0(/)  =  0,  and 

J„0.  S.  A,  p)  =  r,(S.  A.  p)  +  (A>  +  M,y)  +  1) 

+  (5  A  i)p  y„-\(i  -  1)  +  (A  -  N, A  -  M,y  -  (5  A  i)p)  Fn_,(/). 

By  letting  A „(/)  =  F„(/)  -  y„(i  -  1),  we  may  rewrite  J„  in  the  more  useful  form: 

■/,,(/,  5,  A.  p)  =  // ( iV, )  +  N,(—g(K)  +  AA„_,(/  +  D)  +  A/,yA„_i(/  -I-  1) 

-  5/(m)  -  (5  A  i)pA„_i(i)  +  A 

Since  f(p )  >  0,  it  makes  no  sense  to  have  5  >  /.  Therefore,  S  A  i  =  S  and  the  possible 
action  space  in  state  /  is  0  <  S  <  /',  Amin  <  A  <  Amav  /u-min  <  M  ^  Mmax- 

We  will  assume  sufficient  continuity  conditions  on  /  and  g  to  insure  that  the  above  max¬ 
imum  is  obtained.  With  this  in  mind,  let  S„(/)  be  optimal  in  state  /,  let  A„(/)  be  the  largest 
optimizing  action  in  state  /,  and  let  p„(i)  be  the  smallest  optimizing  action  in  state  /,  all  when  n 
periods  remain.  More  concisely,  we  will  say  these  actions  are  optimal  for  (n,  /).  Our  main 
result  below  is  that  S„(i)  and  p„(i)  are  increasing  functions  of  /  and  A „ ( / )  is  a  decreasing  func¬ 
tion  of  /.  We  also  show  how  these  optimal  actions  behave  as  functions  of  n,  «,  M,  and  N.  Our 
first  observation  follows  easily  (and  we  immediately  see  the  benefit  of  this  particular  discrete 
time  formulation).  From  the  above  form  of  J„  it  is  clear  that 

(i)  A„(/)  maximizes  -£(A)  +  AA„_i(/  +  1), 


\ 
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(ii)  n„(i)  minimizes /(m)  +  mA„_|(/), 

0  if  /(m„</)>  ^  0 

(iii)  s„(i)  =  ,  jf  /(Ms(/))  +#1(i(/)An_1(,)  <  o. 

We  are  now  ready  for  the  results  of  this  section. 

PROPOSITION  1:  If  h(H)  -  /<(jV+i)  S*  *(Am,n)  for  /  ^  Af.  then  A„</)  <  0  for  n  >  0, 
1  <  /  <  M  +  N. 


PROOF:  We  use  induction  on  n.  For  n  =*  0,  Ao(')  =  0,  so  the  result  is  trivial.  Assum¬ 
ing  X -i(')  ^  0.  lei  (5|.  A | ,  ^|)  be  the  optimal  actions  for  (n,  /)  and  (Si.  A2.  n:)  be  the 
optimal  actions  for  (n.  /  —  1),  and  we  have 

(A  +  <*)A„(/)  =  J„(i.  S,.  A,.  n\)  -  J„(i  -  1,  S2.  Ai.  M:) 

<  J„(i.  S,.  A,.  nx)  -  J„(i  -  1.  S2.  A,. 

=  h(N,)  -  /i(N,_,)  -  (N,  -  N,_,)k( A,)  +  (/V.A,  +  Afy)A„_,(/  +  1) 

—  S|/(f*|)  +  S2(./(m  2)  +  #i2A„_i(/  —  D) 

+  (A  —  A/,-|A|  —  A/,-iy  —  S,m  1 ) A„_| ( / ) 

<  h (IV)  -  hi N,.t)  -  (.v,  -  #,_,)*( A,)  +  S2(/(m2)  +  -  D). 

by  the  induction  hypothesis  and  definition  of  A.  Also,  if  /  <  -A/,  //(A',)  —  li(N,.t)  — 

(.V,  -  .y_,)g(A,)  =  0.  and  if  i  >  M  +  1,  li(N,)  -  h(N -  (N,  -  A,)  =  //(A',)  - 

//(.V_|)  +  .if  (A,)  <  0.  by  the  assumption  of  the  proposition.  Finally,  either  S2  =  0  or  /(#*2)  + 
MiA„_i(/  -  1)  <  0,  so  that  S2(/(M2)  +  M2A„_|(i  -  D)  <  0.  This  completes  the  proof. 

THEOREM  1:  If  /;(/V,)  -  W+l)  ^  f?(Amax)  for  /  ^  M  then  A„+,(/)  A„(/)  Tor  n  > 

0,  1  <  /  ^  M  +  N. 

PROOF.  For  n  =  0,  we  need  A 1  ( / )  ^  0.  We  have 

(A  +  «)  F,(/)  =  max  [hiN)  -  N,g(\)  -  S/(/a)}  =  h(N,)  -  N„ g(Amax). 

S.  \ .  fi 

so  that  A| (/)  =  0  for  1  <  A/ and  A(</>  =  h(N,)  —  li(N,_ ()  +  jf(Amax)  ^  0  for  /  >  M.  Now 

assume  A „ ( / )  ^  A, ,-)(/)  for  each  i,  and  let  (S|,  A|.  and  (S2,  A2.  /a2)  be  optimal  for 

(n  +  1.  1 )  and  ( n,  i  -  1),  respectively.  Now  we  have 

(A  +  a)(\„+\(i)  —  A„(/))  ^  J„  +  i(i.  S |,  A|.  fi|)  —  J„(i.  S A|.  fi.|) 

—  y„+1(/  —  1.  s2.  a2.  /u2)  +  /,(/  —  1.  s2,  Ai,  |i2) 

=  (AT, A,  +  A/,y) (A„(/  +  1)  -  A„_|(t  +  1» 

+  S2M2(A„(/  -  1)  -  A„_,(/  -  D) 

+  (A  -  iV,_,A2  —  M,-\y  ~  S|/a|)(A„(/)  —  A „ _  1  ( / ) )  ^  0. 
by  the  induction  hypothesis. 

COROLLARY  1:  If  /» (iV,)  -  h(N,+x)  >  g(Amax)  for  /  ^  A/,  then  A„(t)  is  decreasing  in  n 
for  0  ^  /  <  M  +  N  -  1,  and  fi„(i)  and  S„(i)  are  increasing  in  n  for  1  <  /  ^  M  +  N. 

PROOF.  If  A,,  is  the  largest  value  of  A  which  maximizes  the  function  k  (A;  c)  -  -  #f (A)  + 
Ac.  then  it  is  easy  to  show  that  A,  is  increasing  in  c.  If  we  let  c  =  A „(/  +  1),  the  monotonicity 
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of  A „(/)  in  n  follows  from  Theorem  1.  The  proof  that  n„(i)  increases  in  n  is  similar.  To  show 
that  S„(i)  increases  with  n ,  it  suffices  to  show  that  /(/*„(/))  +  ,(/)  decreases  with  r/. 

We  have 

+  M/i+i(')A„(/)  ^  /(M(1(i))  +  jin(/)A„(t) 

^  M „(/ ) A„_| (/ ) . 

from  Theorem  1  and  the  definition  of  *i„  +  )(/). 

Corollary  1  slates  that  when  there  is  more  time  remaining,  the  system  works  harder,  in 
the  sense  of  more  maintenance  (lower  A)  and  more  repair  (higher  Sand  /x).  The  next  theorem 
and  its  corollary  show  that  for  a  given  n,  the  system  works  harder  when  more  machines  are  bro¬ 
ken  down.  From  now  on  (except  for  Theorem  3  and  Corollary  3),  we  will  need  to  assume  that 
It  is  a  linear  function,  so  that  lt(j)  =  jr ,  0  <  j  ^  A. 

THEOREM  2:  Assume  g  is  convex,  differentiable,  and  satisfies  -r  +  #(Amm)  - 

A’'Um,n)Amin  <  0  for  /  >  M.  Then  \„(i  +  1)  <  A„(i)  for  n  >  0,  1  <  /  <  M  +  A  -  1. 

PROOF:  Before  proceeding,  we  note  that  the  above  conditions  imply  //(A,)  -  /;(/V,+,)  ^ 
for  i  Js  M,  so  that  the  results  of  Proposition  1  and  Corollary  1  are  in  effect.  This  fol¬ 
lows  since  g'  <  0. 

For  n  =  0,  the  result  is  trivial.  Now  assume  inductively  that  A„_! (/  +  1)  <  A ,  1 

<  /  ^  M  +  A  -  1  and  that  r  -  g(\ „(/))  +  A„(/)A„_,(/  +  1)  >  0  for  /  >  M  +  1 . 

First  we  show  that  A„(/  +  1)  <  A„(/).  The  difference  (A  +  a)  ( A „ ( /  +  1)  -  A„(/))  is 
the  sum  of  four  expressions:  the  terms  with  rand  A,  the  terms  with  >,  the  terms  with  Sand  n, 
and  the  terms  with  A.  We  look  at  each  of  these  separately. 

(i)  (r.A  terms)  =  A  -  A,+I(r  -  j?(A„(/  +  1))  +  A„(/  +  +  2)) 

-  2 A, ( r  -  g(\ „(/))  +  A „ (/) A„ _i (/  +  D) 

+  A,_,(r  -  jf(A „(/  -  1))  +  A„(i  -  1)A, ,_,(/)). 

If  /  <  M  -  1 , 

A  <  AA„(/  +  1 ) ( A „ _ , ( /  +  2)  -  A „ _ , ( /  +  1)) 

-  AA„(/  -  1)(A„_,(/  +  1)  -  A „_,(/') ) 

<  —  A  A  „  ( /  -  1 )  ( A  „  _  |  ( /  +  1)  -  A, ,_,(/)). 

If  /  5*  M  +  1 , 

A  <  A,+1A„(/  +  1)(A„_,(/  +  2)  -  A ,,_)(/  +  D) 

-  A,_,A„(/  -  l)(Afl_|(/  +  1)  -  A, ,_,(/)) 

<  -  A,_,A„(/  -  n<A„_,((  +  1)  -  A„_,(i'))t 
since  2A,  —  A,+|  +  A,_t. 

If  i  -  M, 

A  -  (A  -  1 ) (r  -  g(\n(M  +  1))  +  A„( A/  -i-  1)A„.,(A/  +  2)) 

-  2 A(r  -  j?(A„(A/))  -(-  A„(A/)A„_|(A/  +  1)) 
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+  N(r  -  -  D)  +  A„(M  -  1) A„_,( A/)) 

<  N\JM  +  1)(A„_|(M  +  2)  -  A„_|(A/  +  D)  | 

-  NK„(M  -  1)(A„_| (M  +  1)  -  A,,-, (AD)  l 

^  -N A„(M  -  1HA„_,(M  +  \)  -  A „_,(*#)). 
where  the  first  inequality  follows  from  the  induction  assumption. 

(ii)  (y  terms)  -  M  HyA„_,(/  +  2)  -  2A/,yA„_,(/  +  1)  +  M ,_,yA„_1(/)  j 

0  if  /  ^  M  +  1 
yA„_,(/)  <  0  if  *  -  M 
-  (AT  -  l)y(A, ,_,(/+  2)  -  A„_,(/  +  D) 

-  ( Af  +  l)y(A„_,(/  +  1)  -  A, ,_,(/)) 

<  -  (Af,  +  Dy(A„_,(i  +  D  -  A, ,_,(/))  if  i  ^  M  -  \. 

(iii)  (S.  m  terms)  =  B  =  -S„(/  +  1 ) (/(m „ ( /  +  D)  +  n„(i  +  1 ) A„_,(/  +  1)) 

+  2S, ,(/)(/(/*„(/'))  +m„(/)A„-iO)) 

-  S„(/  -  1 ) (/(m „ ( /  -  D)  +m„('  -  1)A„_|(/  -  D). 

From  the  induction  assumption,  it  follows  as  in  Corollary  1  that  S„(i)  is  increasing  in  i.  There¬ 
fore,  we  examine  expression  B  by  looking  at  the  following  four  cases. 

CASE  1.  S„(i  +  1)  =  0.  Then  B  =  0. 

CASE  2.  S„0  +  1)  =  /  +  1,  S„(/)  =  0.  Then 
B  “  —  (/  +  I )(./(/!„(/  +  1 ))  +  fx„((  +  1 ) A„_|(/  +  1)) 

=  —  0  +  l)/i„(/  +  1)(A„_|(/  +  1)  —  A, ,_)(/)) 

-  (/  +  1  )(/(/*„(/  +  D)  -I-  m„('  +  DA, ,_,(/)) 

<  -(/  +  1)M„('  +  D(An_,(/  +  D  -  A „_,(/) ) 

—  (/  +  »(/(/*,(/))  +m«(')A,-i(<)) 

^  —  (/’  +  \)n„(.i  +  D(A„_t(/  +  1)  —  A„_i(/)), 
since  S„(i)  =  0  implies  /(/*«(/))  +  m«(')A„_,(/)  2*  0. 

CASE  3.  S„(/)  =  /,  S„(/  -  D  =  0.  Then 
B  «£  -(/  +  UM„(/  +  D(A„_|(/  +  D  -  A„_|(/)) 

+  (/  -  \)(/{fjL„{i))  4-  /i„(/)A„_|(/)) 

<  -(/  +  l)M„(/  +  D(A„_,(/  +  D  -  A „_,(/)) . 
since  S„(i)  =  /  implies /(/*„('))  +  m»(')A„_|(/)  <  0. 

CASE  4.  Sn(i  —  D  —  /  —  1.  In  this  case 
B  ^  -(/  +  !)/*„('  +  1)(A„_|(/  +  1)  -  A, ,_,(/)) 

+  (/  -  l)ji„0  -  1 ) ( A „ _ | ( / )  -  A „ _ | ( /  -  D) 

<  -(/  +  1  +  1 ) (A„_, (/  +  1)  -  A „ _ | ( / ) ) . 
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(iv)  (A  terms)  =  A(A„ .,(/  +  1)  -  A„_i(/)).  The  only  thing  left  to  do  now  is  to  combine  the 

above  "leftover"  terms,  all  negative  multiples  of  A „ _ j < /  +  1)  —  A „ _ | ( / ) ,  with  this  A  term 

to  establish  the  result. 

We  finally  show  that  r—  tf(\,1+i(/))  +  A„  +  |  ( / ) A „ ( /  +  1)  >  0  for  /  ^  AT  +  1  to  establish 
the  induction.  We  have,  for  /  ^  M  +  1, 

A„+1 0)A„(f  +1)  -  (A„  +  ,(/)/(A  +  «))  (A,tl(r  -  g(A„(/  +  D) 

+  A„(/  +  1)A„_,(>  +2))  -  SJi  +  1  )(/W'  +  D)  +M„('  +  DA„_,(/  + 1)) 
+  S„(/)(/(m„0))  +  4„(/)A„_,(/))  -  A,(r  -  g(k„(i)) 

+  A  „  ( / )  A  „  _  |  ( /  +  1))  +  A  A  „  _  |  ( /  +  1)1- 

By  induction,  r  -  g(kn(i  +  D)  +  A„(/  +  1)A„_,(/  +  2)  ^  0.  By  an  argument  similar  to 
above,  it  is  easy  to  show  that  the  combined  terms  with  S  and  /a  are  non-negative.  Therefore, 
we  have 

A„+1(/)A„(/  +  1)  ^  (A„+,(/)/(A  +  «))  [ — iV, ( r  -  g(A„(/)) 

+  (A  -  N,k „(/))A„_,(/  +  1)] 

^  (An+|(/)/(A  +  a))[—Nl(r  -  g(k„(i)) 

+  (A  -  jV,A„(/))(— r  -I-  g(A„(/)))/A„(/)] 

=  (A/ (A  +  «))A„  +  ,(/)  [(— r  +  g(k„(i)))/\„(i)] 

For  fixed  /,  consider  the  graph  of  the  function  A  (A)  =  —r  +  g(k).  By  the  condition  of 
the  theorem,  the  linear  extension  of  this  graph  from  Am,n  back  to  A  =  0  stays  nonpositive.  II 
we  recognize  A  ( A ) / A  as  the  slope  of  the  line  from  (0,  0)  to  (A.  A  (A)),  then  the  convexity  of  g 
and  the  fact  that  A„  +  ,(/)  <  A„(/)  imply  that  A  (A  „ +1  (/ )  )/A  „ ( /)  <  A  ( A  „  ( / ) )  /  A  „  ( / ) .  There¬ 
fore, 

A„+I(/)A„(/  +  1)  ^  (A/ (A  +  re))(-r  +  g(A„+i(/))) 

>  -T  +  A(An+|(/)), 
and  the  proof  is  complete. 

The  following  corollary  follows  from  Theorem  2  exactly  as  Corollary  1  follows  from 
Theorem  1. 

COROLLARY  2:  Given  the  same  conditions  as  in  Theorem  2,  A „ ( / )  is  decreasing  in  / 
and  S„(i)  and  /*„(/)  are  increasing  in  i. 

Before  proceeding,  we  remark  that  the  differentiability  of  g  assumed  in  Theorem  2  is 
obviously  not  necessary.  All  we  need  is  that  a  linear  (or  convex)  extension  of  g  from  Amln  back 
to  A  =  0  remains  sufficiently  small,  and  this  condition  is  most  easily  stated  in  terms  of  g'  when 
exists. 

We  now  examine  how  the  optimal  actions  behave  as  functions  of  re,  AT,  and  N.  The  addi¬ 
tional  subscripts  will  show  the  dependence  of  the  various  quantities  upon  these  parameters. 

THEOREM  3:  If  h(S,)  -  /i(N,+l)  >  jf(Amln)  for  /  >  A/,  then  A„  „,(/)  ^  A„.„}(/)  for 
discount  factors  re,  <  n2,  and  all  n  ^  0,  1  <  M  +  N  -  1.  (In  this  theorem  and  the  fol¬ 

lowing  corollary,  /;  need  not  be  linear.) 
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PROOF:  For  n  =  0,  the  result  is  trivial.  Assuming  it  is  Uue  for  n  —  1,  let  F„„(/)  = 

/.('■  S„  .„(/),  fi„J' ))  and  write  A„  ,t|(/)  =  F„„(/)  -  F„  „(/  -  ]).  Then  A„,t|(/)  - 

•X,  «,(')  “  X, «, A  +  «,)  -  A„  „2(/)/( A  +  «2).  If  we  can  show  that  0  ^  A„„,(/)  > 

A „  „ , ( ' )  for  each  /,  then  the  result  will  follow  since  1  /( A  +  «,)  >  1  /( A  +  «,).  But  A„„(/)  < 

0  from  Proposition  1,  and  ^  A„  „ t ( / )  by  a  proof  almost  identical  to  the  proof  of 

Theorem  1 

COROLLARY  3:  If  //(AO  —  //(/V,  +  !)  ^  Jif < Amin)  for  /  ^  A/,  then  A ,,,,(/)  is  increasing  in 

<*  and  S„  „(i)  and  are  decreasing  in  «.  That  is,  the  more  the  future  is  discounted,  the 

less  the  system  works. 

The  following  theorem  and  its  corollary  say  that  if  we  compare  two  systems  in  states 
which  have  the  same  number  of  working  machines,  the  only  difference  being  that  one  system 
has  an  additional  spare,  then  the  system  with  less  spares  will  work  harder. 

THEOREM  4:  Consider  two  systems  which  are  identical  in  every  respect  except  that  one 
has  M  spares  while  the  other  has  M  +  1  spares.  Assume  that  the  conditions  of  Theorem  2 

hold  for  each  system.  Then  A„  u(i)  ^  A„.w+1(/  +  1)  for  n  >  0,  1  <  /  <  M  +  N,  if  A  is 

redefined  as  A  —  A/Amax  +  ( M  +  l)y  +  (A '  +  M  +  l)jxmax  for  each  system. 

PROOF.  We  only  sketch  the  proof,  since  the  details  are  similar  to  those  in  Theorem  2. 

First,  notice  that  A/.  w  =  A/,  +  t  w+!  and  M,  it  —  M,  +  ]  w+1,  since  the  only  difference  between 

state  /  in  the  first  system  and  state  /  +  1  in  the  second  system  is  the  one  extra  machine  in 

repair.  This  observation  allows  us  to  group  the  A  and  y  terms  of  A„  w(/)  -  A„  w+l(/  +  1)  in 
the  obvious  manner,  while  the  r  terms  cancel.  Finally,  for  the  5,/x  terms,  the  observation  that 
S„  u(i  -  1)  =  /  —  1  or  S,„/+l(/  +  1)  =  /  +  1  implies  S„  w(/)  =  /  and  that  S„  M+1(/)  =  / 
implies  S„  w+|(/  +  1)  =  /  +  1  defines  the  possible  cases  we  need  to  consider.  From  this  point, 
the  proof  is  entirely  analogous  to  the  proof  of  Theorem  2. 

COROLLARY  4:  Given  the  same  conditions  as  in  Theorem  4,  A„  W4.,(/  +  1)  ^  A„  M(i), 
M »/(/)  ^  m.i  w+|('  +  1),  and  if  S„  n+\(.i  +  1)  =  /  +  1,  then  S„  \/(/)  =  /. 

We  next  consider  two  systems  with  the  same  number  of  spares,  but  where  one  system  has 
one  more  working  machine  than  the  other.  If  we  compare  two  states  where  each  system  has 
the  same  number  of  machines  in  repair,  then  we  find  the  possibly  surprising  result  that  the  sys¬ 
tem  with  more  working  machines  works  harder. 

THEOREM  5:  Consider  two  systems  which  are  identical  in  every  respect  except  that  the 
first  has  N  working  machines  while  the  second  has  N  -I-  1  working  machines.  Assume  that  the 
conditions  of  Theorem  2  hold  for  each  system.  Then  A„  v+|(/)  ^  A „.*(/')  for  n  >  0,  1  <  /  C 
M  +  /V,  if  A  is  redefined  as  A  =  (N  +  1)  Amax  +  My  +  (M  +  N  +  1)  /umax  for  each  system. 

PROOF:  Again  we  only  sketch  the  proof.  First  we  notice  that  N,  v+l  ~  N,  v  +  1  and 
M  v+i  =  A/,  v,  since  the  extra  machine  in  the  second  system  will  be  working  for  any  state  i. 
This  allows  us  to  group  the  A  and  y  terms  in  A„  V+1  (/)  -  A ll  N(i)  in  an  obvious  way  and  the  r 
terms  cancel.  Finally  we  deal  with  the  S,  fi  terms  in  the  standard  way  by  using  the  inequalities 
S„  v  ('  ~  1)  <  S„.v(/)  ^  S„ ,iV+i(/)  and  S„  \(i  —  1)  ^  S„jv+i('  —  1)  ^  S„  v+|(<)  to  establish 
the  possible  cases. 

COROLLARY  5:  Given  the  same  conditions  as  in  Theorem  5,  A „  v(/)  is  decreasing  in  A' 
and  n„  v(/)  and  S„  v0)  are  increasing  in  A/. 
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In  our  final  model  of  this  section,  we  examine  the  case  where  there  are  no  spares,  that  is, 
M  =  0.  Then  A,  =  A  -  i  and  M,  —  0.  Here  we  find  that  if  li  is  linear,  then  A„(/)  and  /*„(/) 
are  independent  of  /,  and  for  a  given  n  we  either  never  turn  on  servers,  or  we  always  turn  on 
one  server  for  each  broken  down  machine. 

THEOREM  6:  Assume  h(j)  =  jr ,  0  ^  j  ^  A,  and  M  =  0.  Then  A „ ( / )  =  A„  (/  +  1) 

for  n  ^  0  and  1  ^  /  <  A  —  1.  Therefore,  A „ ( / )  =  A„  (/  +  1)  and  /*„(/)  =  /x„  (/  +  1)  for  all 

/,  and  either  S„(/)  =  /  for  all  /  or  S„(i)  =  0  for  all  /. 

PROOF:  For  n  =  0,  the  result  is  trivial.  Assume  A „ _ i ( / )  =  A „ _ t ( /  +  1)  for  each  /.  This 
obviously  implies  that  A„(/)  and  are  constant  functions  of  /  and  that  S„(i)  =  /  for  all  /  or 
S„(/)  =  0  for  all  /.  Letting  A  =»  A „ ( / ) ,  /a  =  n„(i),  and  A  =  A„_|(/),  this  yields 

(A  +  a ) A „ ( / )  =  (A  -  i)  (r  -  #(A)  +  AA)  -  S„(/)  (g(^)  +  ^A) 

—  (A  -  /  +  1)  (r  -  #(A)  +  AA)  +  S„  (/  -  1)  (#(/u)  +  /u A)  +  AA 

=  -(/•-  ^(A)  +  AA)  -  (S„(/)  -  S„  (/  -  D)  +  ^A)  +  AA. 

Since  S„(i)  -  S„(i  -  1)  =  1  for  all  /  or  0  for  all  /,  we  see  that  A „(/)  is  constant  in  /,  and  the 
result  is  proved. 

As  a  corollary  to  this  result,  consider  the  two  systems  which  are  identical  except  that  their 
failure  rates.  A,  and  A2,  no  longer  considered  control  variables,  are  different.  That  is,  suppose 
each  working  machine  in  system  /  fails  at  rate  A,,  where  A,  <  A2.  Also  assume  M  =  0  and  that 
S  and  /x  are  the  only  control  variables.  Intuitively,  we  might  expect  that  the  system  with  the 
larger  failure  rate  would  work  harder  to  repair  to  its  machines.  This  result  turns  out  to  be  false. 
In  fact,  its  exact  opposite  is  tTue,  as  exhibited  in  the  following  corollary. 

COROLLARY  6:  Given  the  above  model  with  #( A)  =  0  and  li(j)  =  jr.  A, ,*,(/)  < 
A„. a2(/>  for  each  «  >  0,  1  <  /  <  A,  and  Amm  <  A,  <  A2  <  Amax.  Hence  /*».*,(')  > 
and  S„ .*,(/)  ^  S„ -Xj(i)  for  each  n  ^  1,  1  <  /  <  N. 

PROOF:  By  Theorem  6,  we  can  write  A „,*.(/)  =  A„,  and  n„K  (i)  —  n,,,  for  each  /. 
Therefore, 

(A  +  rt)(A„  X|(/)  -  A,,,//))  =  (A  -  / ) ( A , A „ _ ,  ,  -  A2A„_|  2) 

—  (A  —  /  +  1 ) ( A | A „ _ i  |  —  A2A„_|  2)  —  (S„  Ki(i)  —  S„  X)(/  —  1)) 

'  |)  +  M;,.  |A„_|  |)  +  (S„ .  *  2  ( / )  —  5„,  a2('  —  D) 

'  (/(Mw. 2)  +  Mn.2A„_|  2)  +  A ( A „ _ |  |  —  A„_i  2). 

By  induction,  (S„  X|(/)  -  S„  X|  (/  —  1))  -  (S„  K}{i)  -  S„  K2(i  —  1))  =  0  or  1.  Both  cases  may 
be  handled  as  in  Theorem  2  to  dispose  of  the  S,  n  terms  (after  possibly  combining  part  of 
them  with  the  A  term).  What  is  left  is 

“  (A  —  A  j )  ( A  „  _  j  j  —  A„_|  2)  +  (A2  —  A|)A„_|  j  ^  0. 

In  general,  if  M  >  0,  the  result  of  the  above  corollary  seems  to  be  partly  true  and  partly 
false.  For  most  small  states  (/  <  M),  it  seems  that  x,(/)  <  n„  Xj(/),  but  that  for  most  large 
states  (/  >  M),  fi„ ,X|(/)  >  /u„  Xj(/).  The  following  numerical  example,  whose  results  are  for 
the  original  continuous  time,  infinite  horizon  problem,  is  typical.  In  this  example,  A  —  2,  M 
”!,/■“  200,  <»  =  25,  and  /(/*)  =  .05fi2. 
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A  “  1  A  =  5  A  =  20 


I 

S(i ) 

M<») 

V(i) 

/ 

SO) 

ItO) 

U(/) 

/ 

SO) 

n(i) 

VO) 

0 

0 

0 

15.98 

0 

0 

0 

15.67 

0 

0 

0 

13.44 

1 

1 

2.39 

15.74 

1 

1 

8.35 

14.83 

1 

1 

16.00 

11.84 

2 

2 

33.65 

12.38 

2 

2 

32.73 

11.56 

2 

2 

29.20 

8.92 

3 

3 

37.84 

8.59 

3 

3 

36.34 

7.92 

3 

3 

31.12 

5.81 

We  end  this  section  by  noting  that  all  of  the  previous  results  are  true  for  the  infinite  hor¬ 
izon  continuous  time  problem.  This  follows  by  letting  n  —  °°  and  using  the  results  from  Lipp- 
man  (14].  Furthermore,  because  of  Theorem  1  and  the  results  of  Lippman  [15],  these  mono¬ 
tonicity  properties  also  hold  for  the  finite  horizon  continuous  time  problem.  To  state  these 
results,  we  will  drop  the  subscript  n  to  denote  the  infinite  horizon  problem,  and  we  will  substi¬ 
tute  t  for  n  to  denote  that  there  is  i  time  left  in  the  finite  horizon  problem. 

THEOREM  7:  Suppose  li(j)  =  jr,  ./'  ^  0,  and  the  assumptions  of  Theorem  2  hold.  Then 
for  the  infinite  horizon  problem,  V(i)  is  concave  and  decreasing  in  /,  n(i)  and  S(i)  are  increas¬ 
ing  in  /  and  M  and  are  decreasing  in  «,  and  A ( /)  is  decreasing  in  /  and  N  and  is  increasing  in  a. 
Also,  Am  +  |(/  +  1)  >  A if(i),  ^  Mvr  + 1 O'  +  1),  and  if  SM+I(i  +  1)  =  /  +  1,  then  SM(i) 

=  i.  For  the  finite  horizon  problem  with  t  time  remaining,  these  same  properties  hold  for 
F,(/),  A , ( / ) ,  S,(/),  and  n,(i).  Finally,  n,(i)  and  S,(i)  are  increasing  in  r,  and  A , ( /' )  is  decreas¬ 
ing  in  t. 

3.  SUMMARY  AND  POSSIBLE  EXTENSIONS 

In  this  paper  we  have  been  able  to  extend  previous  results  concerning  the  structure  of 
optimal  policies  for  the  machine  repair  problem.  Namely,  we  have  let  A  and  S,  as  well  as  n,  be 
control  variables  and  we  have  allowed  spares  to  fail  before  they  are  put  into  use.  There  are 
still,  however,  many  other  complications  of  this  model  which  are  worthy  of  further  study. 
Unfortunately,  the  ones  we  have  in  mind  will  probably  present  many  more  mathematical 
difficulties  than  the  model  we  have  examined  here. 

One  obvious  feature  which  would  be  good  to  include  is  the  presence  of  fixed  costs  for 
turning  on  or  turning  off  servers.  Since  there  has  been  only  limited  success  with  this  type  of 
queuing  control  in  other,  probably  less  complicated,  queuing  models,  we  are  not  optimistic 
about  success  here.  Another  possible  extension,  or  set  of  extensions,  is  to  complicate  the  prob¬ 
lem  in  such  a  way  that  there  are  two  or  more  state  variables.  For  example,  we  could  assume 
that  the  time  to  install  spares  is  an  exponential  random  variable,  or  that  failed  spares  go  to  their 
own  repair  facility,  or  that  working  machines  may  fail  in  more  than  one  way  and  that  each  type 
of  failure  requires  a  different  repair  facility  (see  [11],  [12]).  In  each  of  these  models,  the  state 
space  is  multidimensional,  which  serves  to  complicate  the  mathematics  immensely.  In  fact,  we 
have  investigated  these  models  quite  extensively.  Unfortunately,  aside  from  an  analogue  of 
Theorem  6,  not  any  of  even  the  most  intuitive  structural  results  could  be  proved  by  the  above 
induction  arguments.  At  the  same  time,  no  counterexamples  to  our  conjectures  were  found. 
Hopefully,  researchers  in  the  future  will  discover  how  (or  if)  Lippman's  "new  device  for 
exponential  queuing  systems"  is  able  to  solve  these  models  with  multidimensional  stale  spaces. 
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ABSTRACT 

In  this  paper  we  present  a  new  formulation  of  the  quadratic  assignment 
problem.  This  is  done  by  transforming  the  quadratic  objective  function  into  a 
linear  objective  function  by  introducing  a  number  of  new  variables  and  con¬ 
straints.  The  resulting  problem  is  a  0-1  linear  integer  program  with  a  highly 
specialized  structure.  This  permits  the  use  of  the  partitioning  scheme  of 
Benders  where  only  the  original  variables  need  be  considered.  The  algorithm 
described  thus  iterates  between  two  problems.  The  master  problem  is  a  pure 
0-1  integer  program,  and  the  subproblem  is  a  transportation  problem  whose  op¬ 
timal  solution  is  shown  to  be  readily  available  from  the  master  problem  in 
closed  form.  Computational  experience  on  problems  available  in  the  literature 
is  provided. 


1.  INTRODUCTION 

In  this  paper,  we  revisit  the  quadratic  assignment  problem  which  was  first  formulated  by 
Koopmans  and  Beckmann  [19]  for  assigning  m  indivisible  entities,  called  facilities,  to  m  mutu¬ 
ally  exclusive  locations.  More  specifically,  we  consider  the  following  most  general  formulation 
of  this  problem  as  introduced  by  Graves  and  Whinston  [14], 

m  m  m  m  m  m  p 

QAP  1 :  minimize  £  £  au  xtj  +  £  £  £  £  £  bfa  xu  xkl 

/-I  y-1  /-l  y-1  *-l  /-I  1 

m  m  m  m  p 

+  £  £  £  £  £  Jik  dji  Xjj  xk, 

/-|  j-\  *_|  /_!  „_| 

subject  to 

*€  XA  =  |(xn,  . . .  ,  xmm)\  £  xtj  -  1,  j  -  1,  ...  m,  £  Xij  =  1, 

l  <-l  y-i 

/'  =  1,  . . .  ,  m,  x0  *=  0,  1,  ij  =1,  ....  ml 


Here  there  are  p  products  that  flow  among  the  m  facilities.  Particularly,  f?k  is  the  amount  of 
flow  of  product  n  from  facility  i  to  facility  k  and  df/  is  a  distance  measure  from  location  j  to 
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location  /  when  transporting  product  n.  Further,  an  is  the  fixed  cost  of  locating  facility  i  at  loca¬ 
tion  j,  and  b"u  is  a  fixed  cost  for  product  n  dependent  on  a  pair  of  assignments,  viz,  facility  i  to 
location  y  and  facility  k  to  location  /.  Note,  that  without  loss  in  generality,  we  can  take  b"lk/  =  0 
if  i  -  k  or  j  -  /  and  also,  f”  -  </"  =  0  for  /,  j  -  1 . m,  n  =  1 ,  ....  p. 

Using  a  simple  transformation  introduced  by  Lawler  [22],  and  extended  by  Pierce  and 
Crowston  [25],  the  above  problem  may  be  written  as: 


QAP  2: 


minimize 

xt  X. 


rn  rn  rn  rn 

^  =  I  I  I  I  %i  xij 


■I  j-l  k- 1  /-I 


Xk, 


where, 

il  b"jki  +  ]£  f!i  dji  if  i  *  k  or  j  *  I 

n- 1  n- 1 

fly  otherwise 

and  where  a  superscript  /,  will  throughout  this  paper,  denote  the  transpose  operation.  However, 
we  will  find  it  more  convenient  to  use  the  following  transformation: 

-  Tzrrr  +  1  (b"«'  + +  t  c/s  n  +  /s  ^). 

(1.3)  /  =  1,  . . .  ,  m  —  1 ,  k  =  /  +  1 . m,  /,  j  =  1 ,  ,  m,  l  ^  j. 

It  may  be  easily  seen  that  this  transformation  leads  to  the  equivalent  problem 

m~ 1  m  m  m 

QAP  3:  minimize  I  I  I  I  cijkl  Xij  xkl 

,-|  y_I 

!*i 


(1.2) 


Sijkl 


The  quadratic  assignment  problem  has  enjoyed  great  popularity  mostly  because  of  its  ver¬ 
satile  applicability,  but  partly  because  of  its  insurmountable  resistance  to  efficient  solution  tech¬ 
niques.  For  a  review  on  quadratic  assignment  problems,  we  refer  the  reader  to  Francis  and 
White  [6],  Gaschutz  and  Ahrens  [8],  Gavett  and  Plyter  [9],  Gilmore  [It],  Graves  and  Whins- 
ton  [14],  Hanan  and  Kurtzberg  [15],  Heider  [16],  Hillier  and  Connors  [17],  Koopmans  and 
Beckmann  [19],  Land  [20],  Lawler  [22],  and  Pierce  and  Crowston  [25],  For  computational  stu¬ 
dies  on  this  problem,  we  refer  the  reader  to  Nugent,  Vollmann  and  Ruml  [23]  and,  more 
recently,  to  an  excellent  study  by  Burkard  and  Stratmann  [2], 

In  the  following  sections,  we  first  employ  a  transformation  which  converts  problem  QAP  3 
into  a  mixed  integer  linear  program  and  thus  makes  it  most  amenable  to  the  partitioning 
scheme  of  Benders  [1].  This  latter  scheme  decomposes  the  problem  into  a  linear  integer  mas¬ 
ter  program  and  several  linear  subproblems  which  are  essentially  transportation  problems.  The 
solution  to  the  subproblems  is  shown  to  be  readily  available  in  closed  form.  We  then  develop 
an  approach  to  the  master  problem.  Finally,  computational  results  and  certain  relevant  conclud¬ 
ing  remarks  are  presented. 

2.  REFORMULATING  THE  QUADRATIC  ASSIGNMENT  PROBLEM 

For  the  purpose  of  this  development,  let  us  introduce  m7(m  -  1 ) 2/2  new  variables, 

(21)  y,tki  “  x,i  xk,  for  /  =  1 . m  -  1,  k  -  /  +  1 . m.  j,  I  -  1 . m.  j  *  / 

In  addition,  introducing  2m  (m  -  1)  new  linear  constraints,  we  formulate  the  following  linear 
mixed  integer  problem  which  is  then  shown  to  be  equivalent  to  problem  QAP  3. 
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m  —  1  m  m  m 

QAP  4:  minimize  I  I  I  I  cijki  y„ki 

,.i  ,-i  *_,+i  /_i 

i*i 


subject  to 


(2.2) 

m  m 

Z  Z  y<ju  -(»!-  i)  x,j  -  0 

*-/+!  l-l 
l*J 

for 

/  —  1.  . . 

.  ,  /n  -  1,  j  -  1, 

....  m 

(2.3) 

k  —  1  m 

Z  Z  y>iki  -  xki =  0 

for 

k  -  2,  . 

. .  .  m.  1  =  1 . 

-  m 

i*i 

(2.4) 

m 

Z  *<>  =  1 

/-i 

for 

/  -  1,  .. 

, .  ,  m 

(2.5) 

m 

Z  x'i  = 1 

/-l 

for 

j  -  1. 

.  %  m 

(2.6) 

x„  binary 

for 

/J  -  1.  . 

. .  ,  m 

(2.7) 

-y,iki  >  -  1  for  /  =  1 . m  - 

1,  k 

=  /  +  1,  . 

. .  ,  mj,/  =  1,  . . . 

.  m,  j  *  1 

(2.8) 

ylik,  Z  0  for  /  —  1 . m  -  1 , 

k  = 

/  +  1,  .. 

.  ,  m,j,l  *  1 . 

m.j  1 

Note  that  problem  QAP  4  has  m2  integer 

and 

m2(m  - 

l)2/2  continuous 

variables  and 

linear  constraints.  As  opposed  to  this,  Lawler’s  [22]  linear  integer  formulation  involves 
m2  +  m4  integer  variables  and  m4  +  2m  +  1  constraints.  A  more  recent  linearization  technique 
due  to  Kaufman  and  Broeckx  [18]  results  in  a  mixed  integer  program  with  m 2  zero-one  and  m2 
continuous  variables  and  m 2  +  2 m  constraints.  In  any  case,  we  will  be  demonstrating  how 
Problem  QAP  4  can  be  solved  by  exploiting  its  structure  and  handling  explicitly  only  its  m 2 
zero-one  variables  and  its  2m  linear  assignment  constraints  (2.4),  (2.5). 

Before  proceeding  to  propose  a  solution  technique  for  problem  QAP  4  we  first  establish 
that  every  feasible  solution  to  Problem  QAP  4  must  satisfy  Equation  (2.1).  As  a  consequence 
of  this  result.  Theorem  1  stating  that  Problems  QAP  3  and  QAP  4  are  equivalent,  follows. 

LEMMA  1:  Let  (x,y)  be  any  feasible  solution  to  Problem  QAP  4.  Then,  ylikl  -  xuxkt  for 
i  =»  1 . m  —  1 ,  k  —  /  +  1 . m,  ./,  /  =  1 ,  ....  m,  j  ^  /. 

PROOF.  First  of  all,  note  that  for  any  p.q.r.s,  p  <  r.q  ^  s,  xwxrv  -  0  implies  from 
Equation  (2.2),  (2.3)  that  yw„  —  0.  Now  consider  p.q.r.s.p  <  r.q  ^  s  such  that  xrq  -  xrs  —  1. 
It  suffices  to  show  that  this  implies  yw„  =  1.  Since  xrs  =  1,  then  by  Equation  (2.3),  we  have, 

r— 1  m 

(2.9)  .  X  Z  y»n  -  r  -  1 

.-I  /-t 

i*s 

But  since  x  satisfies  the  assignment  constraints  (2.4)  through  (2.6),  and  since  xrs  -  1,  there  are 
precisely  r  -  1  nonzero  variables  xtj  for  i  <  r,  j  &  s.  Since  xf/  -  0  implies  yllrs  —  0  from 
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above,  this  in  turn  means  that  there  are  at  most  r  —  1  nonzero  variables  y,in  for  /  <  r,  j  ^  s. 
However,  in  view  of  Equations  (2.7),  (2.8),  (2.9),  we  must  have  precisely  r  —  1  unit  variables 
y,irs .  each  corresponding  to  x„  -  1  for  /'  <  r,  j  *  s.  In  particular,  since  xw  -  1, 

p  <  r,  q  5*  s,  then  ywn  —  1 .  This  completes  the  proof. 

THEOREM  1:  Problems  QAP  3  and  QAP  4  are  equivalent  in  the  following  sense.  For 
every  feasible  solution  to  Problem  QAP  3  there  corresponds  a  feasible  solution  to  QAP  4  with 
the  same  objective  function  value  and  vice  versa. 


PROOF:  Let  or  be  a  feasible  solution  to  Problem  QAP  3  and  let  us  define  the  vector  y 
through  Equation  (2.1).  We  will  show  that  (x,y)  qualifies  for  the  corresponding  solution  we  are 
seeking  to  Problem  QAP  4.  Now,  Equation  (2.1)  implies  that 


(2.10) 


m  m  m  tn 

Z  Z  y<iki  “  Z  Z  x'ixk!  “  0  for  /  -  1 ,  . . .  ,  m  -  1 ,  j  -  1 ,  . . .  ,  m 

A-/  +  I  /-I  A-/  +  1  /-I 

i*i  i*  i 


If  for  any  such  i.j,  we  have  x,,  -  0,  then  from  Equation  (2.10),  Equation  (2.2)  must  hold.  On 

m 

the  other  hand,  if  x„  —  1,  then  since  we  must  have  x,,,  -  0  for  each  k  >  i,  we  get  £  xki  = 


0.  Thus,  using  Equation  (2.4),  we  have. 


A--/  +  I 


m  m  m  m  m  m  m 

(2.11)  £  X  Xnxk!  ”  X  Xx*/+  X  **/“  X  X  *u  “  (*  -  ') 

k-i  +  l  I- 1  *-,  +  l  /-I  *-i  +  l  A-/  +  I  l-\ 

I*  i  l*i 


Thus,  from  Equations  (2.10),  (2.11),  again  Equation  (2.2)  holds.  Similarly,  one  may  show  that 
Equations  (2.3)  holds.  Also,  from  the  definition  of  y  through  Equation  (2.1),  we  have  that 
Equations  (2.7),  (2.8)  hold  and  that  the  objective  function  values  of  Problems  QAP  3  and  QAP 
4  are  equal. 


Conversely,  let  (x,y)  be  a  feasible  solution  to  QAP  4.  We  will  show  that  x  qualifies  for 
the  corresponding  solution  we  are  seeking  to  Problem  QAP  3.  Clearly,  x  is  feasible  to  QAP  3. 
That  the  objective  function  values  are  equal  follows  from  Lemma  1  which  asserts  that  Equation 
(2.1)  holds.  This  completes  the  proof. 


We  now  proceed  to  develop  a  solution  procedure  for  Problem  QAP  4.  This  technique 
exploits  the  following  special  structure  of  the  problem.  The  variables  x  are  restricted  to  be 
extreme  points  of  the  assignment  polytope,  or  points  in  the  set  XA.  For  each  such  point,  the 
resulting  problem  in  j  is  a  transportation  problem  whose  solution,  as  seen  above,  is  given 
through  Equation  (2.1).  This  structure  is  most  efficiently  exploited  by  Benders’  Decomposition 
scheme  111. 


3.  APPLICATION  OF  THE  PARTITIONING  SCHEME  OF  BENDERS. 

In  this  section,  we  will  decompose  Problem  QAP  4  into  a  linear  integer  master  problem  in 
m2  zero-one  variables  and  a  linear  subproblem,  using  the  method  due  to  Benders  (see  Benders 
Ill  and  Lasdon  [21]).  At  each  iteration,  the  master  problem  generates  a  point  of  XA ,  based  on 
which  a  suitable  subproblem  is  solved  to  generate  a  cut-constraint.  This  cut  is  then  appended 
to  the  other  constraints  in  the  master  problem  and  the  solution  to  the  latter  is  updated.  The 
procedure  hence  iterates  between  the  master  problem  and  the  subproblem  until  a  suitable  ter¬ 
mination  criterion  is  met  in  a  finite  number  of  steps. 
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To  conduct  such  a  decomposition,  observe  that  for  a  fixed  jc€  A"*,  problem  QAP  4  is  a 
t/-a"*po/'a‘‘on  Problem  '1  ‘he  .^variables  over  a  set,  say,  Y(x)  defined  by  constraints  (2.2), 
(2.3),  (2.7)  and  (2.8).  Thus,  we  may  write  QAP  4  as 


QAP  4.  minimize 


vC  X. 


tn  —  I  w  m  m 

minimum  I  I  I  I 

cnk -i  y.iki 

ru’  -i  /-i  k>,  i *, 


For  the  inner  minimization  problem  above,  letting  u,h  /  =  1 . m  —  1  j  —  1  m  be  the 

dual  variables  associated  with  constraints  (2.2),  v<„  k  -  2 . m,  I  =  1.  ihose  with 

2.3  and  wllU,  /  -  1 . m  -  1.  k  -  /  +  1 . m.  !J  -  \ . m,  /  *  j  those  with 

U./),  we  may  use  the  linear  programming  dual  to  rewrite  the  above  problem  as 

QAP  4.  minimize  (5/>(x)l 

where,  for  a  fixed  x€XA,  we  have, 

SP(x):  maximize!  I  I  (m  -  /)  «„  *„+££(*-  1)  V(W  xu 

/-I  A-2  /-I 

w  -  l  »n  m  m 

-  I  I  I  I  *,*, 

/-I  A-.  +  l  I- 1  /-I 

i*l 


subject  to 

uu  v*/  —  wijkl  ^  ty*/ 

2s  0| 


W,jk! 


for  /  =  1 . m  -  1,  k  -  /  +  1, 


m,  j,  /  =  1 . m 

j*i 


Now,  using  the  standard  transformation  or  replacing  unrestricted  variables  by  the  difference  of 
two  nonnegative  variables,  one  may  show  that  SP(x)  attains  an  optimal  solution  at  an  extreme 

point  of  the  resulting  constraint  set.  Thus,  letting  (//'’.  v',  w").  />€£=(] . £)  be  the 

finite  number  of  points  corresponding  to  the  extreme  points  of  this  set,  then  SP(  v),  and  hence 
QAP  4  may  be  rewritten  with  obvious  notation  as 


QAP  4:  minimize  f  maximum  (a'  v  -  «„)1 
v€  V,  lp€  ZT-Il  P\  '  "  J 

Finally,  the  above  problem  may  be  represented  as  the  master  problem 
MP(E):  minimize  z 


subject  to  z  >  a'px  -  ar  for  each  p6  E 
x€XA 


The  scheme  of  Benders’  asserts  that  one  need  not  generate  the  entire  set  £,  but  rather 
generate  suitable  elements  of  £  as  and  when  needed.  Thus,  at  any  particular  stage  r,  letting 

1  ^  }Er'  =  r  ^  p'  and  denoting  the  corresponding  relaxed  master  program  by 
M£(£r),  the  following  scheme  is  validated  by  Benders. 


Initialization: 


Let  z'  =  -oo,  choose  an  arbitrary,  preferably  good, 
solution  .v  €T^,  set  r  =  1  and  £q  =  $,  and  go  to  Step  2. 


Step  1:  Solve  the  master  problem  */£(£,).  Let  (*'+l,  z'  +  l)  be  an  optimal  solution. 

Increment  r  by  one  and  go  to  Step  2. 
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Step  2.  Solve  the  subproblem  SP(.x  0.  Let  («',  v',  w 0  be  an  optimal  extreme  point 

solution  with  objective  function  F.  If  F  —  z\  terminate  with  x  '  as  an  optimal 

solution  to  the  quadratic  assignment  problem.  Otherwise,  z  >  z\  and  add 

the  cut  ^  , 

2  >  arx  -  ar 

to  the  current  master  problem.  Letting  E,  -  U  (r),  go  to  Step  1. 


The  cardinality  of  E  being  finite,  the  above  procedure  is  finitely  convergent  since  if  any  element 
of  £is  re-generated  then  the  termination  criterion  of  Step  2  is  necessarily  met  (see  [1]  or  [21]). 
Over  the  next  two  subsections,  we  will  now  discuss  the  solution  strategies  for  the  master  prob¬ 
lem  and  the  subproblem. 


3.1  On  Solving  the  Subproblem 


Recall  that  for  the  sake  of  finiteness  of  Benders’  scheme,  we  need  to  generate  an  optimal 
extreme  point  solution  to  problem  SP(x  0  where  all  unrestricted  variables  are  transformed  into 
nonnegative  variables.  Writing  the  Kuhn  Tucker  conditions  for  problem  SP(x  0  and  using  a 
rather  elaborate  proof,  one  can  show  (see  [27])  that  the  following  solution  qualifies,  as  such  an 
extreme  point  solution. 


ur+l  = 
“v 


maximum  [c,,*,] 


(k.l):  x'kr  I 
k>i 


if  x\j=  1 

for  i  =  1 . m  —  lj  =  1 ,  ....  m 


minimum  l cukl  -  v^+1]  if  x'j  =  0 

(*,/):*>  i 


vr+l 

vw 


o 

ii 

1 

1 

minimum 

minimum  {cy*,  -  u'+1},  minimum  |cyW) 

(i.j):x'j- 1  U.j):x'—0 

r 

i<k  i<k 

»'jk'  = 


for  k  =  2 . m,l  =  1 . m 


0  if  x^  =  0 

for  /  =  1 . m  —  1  ,k  =  i  +  1 ,  ....  m,  j,l  =  1 ,  . . .  ,  m,l  ^  j. 

«,7+1  -  C,jkl  if  XU  *kl  =  1 


We  briefly  note  at  this  point  that  one  may  write  out  an  alternative  optimal  extreme  point  solu¬ 
tion  by  simply  interchanging  the  roles  of  the  variables  u  and  v  above.  Through  computational 
testing  we  found  that,  depending  on  the  data  structure,  one  of  these  solutions  yields  cuts  that 
perform  significantly  better  than  those  obtained  from  the  other  solution. 


3.2  On  Solving  the  Master  Problem 

First  of  all,  note  that  the  master  problem  need  not  be  solved  exactly  at  each  iteration. 
Recall  that  in  using  Benders’  decomposition,  we  are  merely  interested  in  generating  elements  of 

£  =  [1 . P)  until  such  time  as  the  value  of  the  master  problem  equals  the  current  best 

value  of  Problem  QAP  4.  Moreover,  while  accomplishing  this,  we  wish  to  ensure  for  the  sake 
of  finiteness  that  we  do  not  regenerate  any  point  of  E  until  the  termination  criterion  is  met. 
Now  suppose  we  find  a  possibly  suboptimal  solution  (x'+1,  fr+1)  to  the  master  problem  at  stage 
r.  If  necessary,  we  then  update  the  current  best  solution  using  xr+l. 

Let  the  current  best  value  be  z  and  suppose  that  i'+l  <  F.  Then  clearly,  the  exact  solu¬ 
tion  value  zr+l  of  the  master  problem  satisfies 

rr+l  ^  £'+1  <  F 
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Moreover,  we  can  also  assert  that  x'+l  has  not  been  previously  generated.  For,  by  contradic¬ 
tion,  if  it  has  been  generated,  at  the  q'h  stage,  q  <  r,  then  let  the  corresponding  Benders’  cut 
bez  ^  ciq  x  —  aq.  Noting  through  Lemma  1  that  the  subproblem  yields  the  same  value  as  the 
quadratic  assignment  problem  for  any  given  solution,  we  get  z  ^  a'q  xr+]  -  aq  <  z'+1  which 
contradicts  zr+l  <  I. 

Thus,  so  long  as  an  approximate  solution  yields  a  value  for  the  master  problem  which  is 
strictly  less  than  that  of  the  current  best  solution  to  the  quadratic  assignment  problem,  includ¬ 
ing  the  value  of  the  new  point  generated,  we  may  use  this  approximate  solution  with  the 
assurance  that  it  has  not  previously  been  generated.  It  is  only  when  this  condition  does  not 
hold  that  we  need  to  solve  the  master  problem  exactly.  Before  discussing  the  proposed  approxi¬ 
mate  and  exact  solution  techniques  below,  we  emphasize  that  this  modification  still  maintains 
the  scheme  as  an  exact,  finitely  convergent  procedure. 


3.2.1  Scheme  for  Generating  an  Approximate  Solution  to  the  Master  Problem 


In  this  section,  we  will  develop  a  heuristic  to  obtain  a  good  quality  approximate  solution 
to  the  master  problem  MP(E,).  Towards  this  end,  suppose  we  compute 

/3,,  =  maximum  { 1 ,  Iminimumla^x  -  o,,)|)  for  each  E, 
and  formulate  the  problem 

MP{Er).  minimizelAr'jr:  x€  T4) 

where. 


(3.2) 


Ar> 


I 

/>-! 


Pp 


Consider  the  solution  obtained  through  the  linear  assignment  problem  MP(Er).  Since  A  'rx  is 
essentially  the  sum  of  the  terms  <x’px,  p€  £,,  each  normalized  by  the  magnitude  /3P,  Problem 
MP(Er)  tends  not  to  minimize  any  cut  expression  too  much  at  the  expense  of  increasing  the 
value  of  others.  In  other  words,  the  solution  to  MP(Er)  tends  to  achieve  the  criterion  of  Prob¬ 
lem  MP(Er),  viz,  to  minimize  the  maximum  of  several  cut  expressions.  This  is  further  sub¬ 
stantiated  by  the  fact  that  the  quantities  minimum  ( apx  -  a p:  x 6  XA]  are  almost  always  nega¬ 
tive,  with  the  result  that  (3.2)  gives  more  weight  to  cut  expressions  which  tend  to  have  algebra¬ 
ically  larger  values.  Thereby,  cuts  which  are  likely  to  be  binding  in  an  optimal  solution  to 
MP(Er)  are  given  more  weightage. 


As  an  additional  improvement  routine  for  this  scheme,  we  adopted  the  following  strategy 
which  enabled  us  to  recover  good  quality  solutions.  Consider  the  linear  bounding  form  g'x  of 
Cabot  and  Francis  13]  satisfying  g'x  4  x'  5*  for  each  x€  XA,  with  coefficients  given  by 


g„  =  minimum 


^  x  €  T .( ,  xlt 

/-I 


1 


Note  that  gn  is  obtained  by  solving  a  linear  assignment  problem  in  m— 1  variables  after  fixing 
the  assignment  xti  =  1.  Now,  let  . 


Vmin  =*  minimumlg'x:  x€  and  »<ma,  =  maximum]#'*:  *€  T.,). 

Working  with  some  test  problems  in  the  literature,  we  found  that  optimal  or  good  quality  solu¬ 
tions  often  had  values  of  g'x  lying  jn  the  initial  10-15%  of  the  range  [vmjn,  i>maJ.  Hence,  we 
found  it  advantageous  to  replace  MP(Er )  by  the  problem 
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minimize 

A/x  + 

w 

g'x:  x  €  X4 

••'min 

l 

where  w  is  a  suitable  weightage  parameter.  In  our  experience,  the  solution  jc,+1  to  problem 
(3.3)  almost  always  yielded  a  corresponding  value  of  zr+]  satisfying  f'  +  1  <  z,  except  towards 
the  end  of  the  procedure  when  an  exact  solution  of  the  master  problem  was  frequently 
required.  Moreover,  the  solution  x'+l  tended  to  be  of  a  good  quality. 


Finally,  we  note  that  each  time  an  approximate  solution  was  obtained  through  (3.3),  we 
attempted  to  improve  on  this  solution  through  pairwise  interchanges,  and  elected  to  use  the 
resulting  solution  in  case  it  also  qualified  as  an  approximate  solution.  Thus,  only  when  neither 
the  solution  to  (3.3),  nor  the  solution  obtained  through  pairwise  interchange  improvements  on 
it,  yielded  a  value  of  r  less  than  the  incumbent  value,  did  we  need  to  resort  to  an  exact  solution 
technique  for  the  master  problem. 


3.2.2  Exact  Solution  Method  for  the  Master  Problem 


We  now  consider  the  exact  solution  of  Problem  MP(Er).  For  this  purpose,  note  that  if 
we  consider  the  linear  relaxation  X  of  the  set  X4,  then  integral  solutions  feasible  to  X  are  in  a 
one-to-one  correspondence  with  the  points  in  X4.  In  other  words,  all  integral  solutions  in  A" are 
zero-one  assignment  solutions.  Thus,  we  first  attempted  the  use  of  Gomory's  [13]  dual  all¬ 
integer  cuts  to  obtain  an  optimal  solution  to  MP(Er).  However,  in  spite  of  incorporating  all  the 
rules  for  finite  convergence  (see  [13]  and  [7]),  we  experienced  the  same  problems  as  Trauth 
and  Woolsey  [29]  and  as  formally  predicted  by  Finkelstein  [5].  Namely  the  procedure  works 
very  well  in  some  instances,  but  is  quite  unpredictable  in  most  instances.  In  some  cases, 
several  thousand  dual-integer  cuts  were  unable  to  find  an  integer  primal  feasible  solution. 
Further,  in  this  process,  the  updated  coefficients  in  the  simplex  tableau  tended  to  blow-up  in 
magnitude. 


We  thus  resorted  to  Glover's  [12]  pseudo-primal-dual  procedure  which  iterates  between  a 
lexicographic  dual  feasible  stage  related  to  Gomory's  dual  all-integer  algorithm  [13]  and  a  dual 
infeasible  stage  related  to  Young’s  primal  all-integer  algorithm  [30],  As  reported  by  Glover, 
this  method  results  in  a  lexicographic  decrease  in  the  updated  solution  column  of  the  simplex 
tableau  on  two  successive  visits  to  the  dual  feasible  stage  by  an  amount  at  least  as  much  as  the 
decrease  which  would  be  obtained  through  two  successive  iterations  of  the  dual  simplex 
method.  Of  course,  if  we  have  dual  feasibility  and  the  condition  z  =  zr  of  Step  2  holds,  then 
we  may  terminate  without  regard  to  primal  feasibility.  The  revised-simplex  method  was  special¬ 
ized  and  rules  for  finite  convergence  (see  [12,  13  and  7])  were  also  incorporated.  Although 
this  technique  contained  the  magnitudes  of  the  tableau  coefficients,  it  also  experienced  conver¬ 
gence  problems. 


Finally,  we  adopted  an  implicit  enumeration  scheme  which  was  initialized  just  once  at  the 
first  visit  of  the  procedure  to  the  exact  solution  routine  for  the  master  problem,  and  was  simply 
updated  at  each  subsequent  visit.  The  updating  feature  permitted  us  to  delete  Benders'  cuts  for 
larger  sized  problems  since  it  ensured  that  at  least  fathomed  solutions  would  not  be  regenerated 
during  the  exact  solution  of  the  master  problem.  This  scheme  either  resulted  in  a  solution  with 
a  value  of  z  less  than  the  incumbent  value,  in  which  case  this  solution  was  adopted  as  our 
approximate  solution,  or  else  it  verified  optimality  by  solving  the  problem  to  termination.  We 
avoid  giving  details  here  of  this  procedure  in  view  of  our  experience  reported  in  the  following 
section.  The  interested  reader  may  refer  to  Sherali  [27],  however. 
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4.  COMPUTATIONAL  EXPERIENCE 

In  this  section,  we  report  computational  experience  using  test  problems  available  in  the 
literature  [4,  23,  29].  We  found  that  the  procedure  performed  satisfactorily  in  detecting 
optimal  solutions  early  on  in  the  search  process.  Even  in  cases  where  storage  limitations  forced 
a  premature  termination,  solutions  of  quality  often  belter  than  any  other  known  in  the  litera¬ 
ture  were  obtained  during  the  early  stages  of  the  search.  In  fact,  recently,  Burkard  and  Strat- 
mann  [2]  have  described  excellent  heuristics  that  significantly  improve  upon  all  the  best  known 
solution  values  of  problems  in  the  literature  We  were  able  to  still  improve  further  upon  the 
values  for  Nugent  et  al’s  (23]  problems  of  sizes  m  =  20  and  m  =  30  and  perform  at  least  as 
good  on  the  other  problems  attempted  by  Burkard  and  Stratmann.  We  also  obtained  a 
significant  improvement  over  the  best  known  solution  to  Elshafei’s  [4]  hospital  layout  problem. 

We  note  that  unlike  the  experience  of  Geoffrion  and  Graves  (10]  on  using  Benders’  parti¬ 
tioning  on  a  multicommodity  distribution  flow  problem,  our  problem  required  a  large  number 
of  Benders’  cuts.  The  reason  for  this,  as  conjectured  later  in  Section  5,  may  be  that  a  solution 
to  the  linear  relaxation  of  Problem  QAP  4  is  not  "close"  to  an  integral  solution.  In  fact,  when 
working  with  Nugent,  Vollmann  and  Ruml’s  [23]  test  problems,  we  found  that  the  procedure 
required  to  generate  close  to  m  !  cuts  in  order  to  verify  optimality,  even  when  the  starting 
solution  was  optimal.  Table  1  below  reports  this  experience.  We  remark  that  the  objective 
function  value  of  the  master  problem  increased  very  slowly  except  for  the  late  stages  of  the 
procedure  when  a  rapid  increase  was  obtained.  As  a  result,  when  the  problems  of  sizes  m  ^  8 
were  prematurely  terminated,  the  master  problem  value  was  as  yet  negative. 


TABLE  1 .  Exact  Implementation  of  Benders  ’  Scheme 


Problem 

- 

.. 

ml 

#  of  cuts 
generated 

Optimality  verified? 

epu  seconds 
execution  time(1) 

5 

113 

YES 

6.9 

Nugent,  Vollmann 

6 

■BltS 

690 

YES 

154.4 

and  Ruml’s 

7 

47 1 1 <2) 

NO 

Problems  [23] 

8 

ESS 

3722<2) 

NO 

770 

(II  On  a  CDC  Cyber  70.  Model  74-28/CDC  6400  machine,  will)  coding  in  Forlran  IV 
(2)  Forced  termination  as  lime  limit  is  reached 


In  view  of  these  results,  we  chose  to  operate  the  procedure  as  a  heuristic  by  terminating  it 
prematurely.  It  may  be  noted  that  the  performance  of  the  procedure  as  a  heuristic  with  regard 
to  the  quality  of  the  best  solution  obtained  is  sensitive  to  both  the  starting  solution  employed, 
as  well  as  the  value  of  w  in  Equation  (3.3).  In  most  cases,  we  found  it  appropriate  to  start  with 
a  value  of  tv  =  1  and  increment  it  by  one  every  three  to  ten  cuts.  For  constructing  a  starting 
solution,  we  adopted  the  following  strategy.  For  each  facility,  we  computed  the  sum  of  flows  to 
all  the  other  facilities  and  for  each  location,  we  computed  the  sum  of  distances  to  all  the  other 
locations.  We  then  arranged  the  facilities  in  nonincreasing  order  of  their  flow  sums  and  the 
locations  in  nondecreasing  order  of  their  distance  sums.  The  desired  solution  was  constructed 
by  matching  the  arrangement  of  facilities  and  locations  element  by  element,  and  then  improv¬ 
ing  upon  the  resulting  solution  through  pairwise  exchanges. 

Table  2  reports  our  computational  experience  with  some  problems  available  in  the  litera¬ 
ture  [4,  23,  28],  Column  ‘a’  gives  the  best  values  obtained  on  using  the  above  starting  solution 
and  terminating  the  process  after  25  cuts.  Column  ‘b’  gives  the  cut  index  at  which  this  best 
solution  was  found.  Through  a  few  subsequent  runs  using  better  quality  starting  solutions  as 
obtained  over  previous  runs,  we  were  able  to  improve  on  the  initial  run  solutions.  Column  'c' 
gives  the  best  objective  values  we  were  able  to  obtain  in  this  manner.  The  quality  of  these 
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TABLE  2.  Implementation  of  Benders'  Scheme  as  a  Heuristic 


Problem  m 

a 

B 

c 

B 

Best  Locations  of  Facilities  1,2,  ....  m 
Respectively  for  Solutions  of  Values  in 
Column  c 

e 

Nugent,  Vollmann  5 

and  Ruml's  6 

Problems  [23]  7 

8 
12 
15 
20 

30 

25 

43 

74 

107 

289 

575 

1285 

3095 

1 

25 

43 

74 

107 

289 

575 

1285 

3077 

25 

43 

74 

107 

289 

575 

1287 

3079 

3, 4,5, 1,2 

3,2,1,6,5,4 

1,2, 5, 3,4, 7,6 

3,4,8,2,1,5,6,7 

5,1,9,8,4,3,11,7,10,2,6,12 

5,4,9,10,12,13,7,2,1,15,6,11,3,8,14 

4,12,19,11,2,5,3,9,20,18,14,13,10,17,7,15,1, 

16,6,8 

20,29,11,5,30,28,15,9,23,21,2,27,22,6,1,10, 

19,8,16,12,24,14,7,26,13,25,3,18,17,4 

0.69 

1.30 

2.25 

3.56 

13.69 

33.93 

100.61 

393.96 

Rectilinear 
Distance  36 

Stein¬ 

berg’s 

4975 

10 

4802 

4802 

14,27,35,25,34,33,15,26,7,16,32,23,24,31,6,8, 

18,17,12,22,20,29,21,28,19,10,30,13,5,4,3,11, 

1,2,9,36 

829.48 

(281  Squared 

Euclidean  36 
Distance 

8232 

16 

7926 

7926 

31,10,2,12,3,4,22,11,29,21,5,14,13,15,30,28, 

19,20,24,23,7,8,16,9,17,18,6,32,33,34,35,25, 

27,26,36,1 

921.02 

Euclidean 
Distance  36 

»; 

I 

1 

B 

Same  as  for  the  squared  euclidean  distance 
solution,  but  with  facilities  1  and  15 
interchanged  in  location. 

958.6 

Elshafei’s  Problem 

(41  19 

ci 

12 

C| 

■ 

17,18,19,11,12,9,3,14,1,2,10,13,7,5,16,15,8,4,6 

96.15 

(1)  This  resulted  through  the  use  of  the  best  recorded  squared  euclidean  solution  as  a  starting  solution. 


a,  b,  c,  d,  e  —  See  the  text  of  Section  4  for  connotation. 

c  |  -  8,  606,  274 
c2  -  11,  281,  888 
c3  -  4125.168 


solutions  may  be  compared  with  the  previously  best  known  solutions  reported  in  the  literature 
[21  as  given  in  column  ‘d’.  Note  that  one  may  use  such  trial  runs  to  select  between  the  two 
subproblem  solutions  given  in  Section  3.1,  preferring  the  one  which  yields  larger  values  for  the 
master  problem,  and  one  may  also  derive  information  on  the  manner  in  which  tv  may  be  incre¬ 
mented  in  Equation  (3.3).  Finally,  column  ‘e'  gives  the  execution  time  in  cpu  seconds  for  a 
run  which  generates  25  cuts  on  a  CDC  Cyber  70  Model  74-28/CD C  6400  computer,  with  cod¬ 
ing  in  FORTRAN  IV.  These  times  do  not  include  the  effort  for  generating  either  the  linear 
bounding  form  g'x  of  Section  3.2.1  or  the  starting  solution. 


Before  concluding,  we  note  that  further  improvements  of  the  procedure  as  a  heuristic  may 
be  possible  through  the  implementation  of  more  sophisticated  exchange  schemes  [24]  in  lieu  of 
the  simple  pairwise  exchange  operations  we  have  employed  at  each  iteration.  Also,  one  may 
choose  to  attempt  a  few  short  trial  runs  with  the  aim  of  gaining  the  type  of  information  alluded 
to  above,  and  then  execute  a  final  run  using  the  best  solution  found  in  the  trial  runs  as  a  start¬ 
ing  solution. 
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5.  CONCLUDING  REMARKS 

li  is  our  purpose  in  this  section  to  discuss,  at  least  for  our  problem,  the  relationship 
between  the  performance  of  Benders'  partitioning  scheme  in  regard  to  the  number  of  cuts 
required  and  the  closeness  of  the  solution  of  the  original  problem  to  that  of  the  linearly  relaxed 
mixed  integer  program.  In  fact,  we  show  below  that  if  the  linear  relaxation  of  Problem  QAP  4 
has  an  integral  optimal  solution,  then  Benders’  procedure  terminates  as  soon  as  this  solution  is 
detected. 


Suppose  that  an  optimal  integer  solution  (x,  y)  to  the  linear  relaxation  of  Problem  QAP  4 
exists  so  that  x  solves  the  quadratic  assignment  problem.  The  Kuhn-Tucker  conditions  can  be 
shown  to  assert  the  following,  where  u,r  v;;,  w,M/  are  the  dual  variables  associated  with  con¬ 
straints  (2.2),  (2.3),  and  (2.7)  respectively: 

y,iki  -  0  —  w,Jkt  »  0  and  u„  +  vA/  <  c„kt 
<s  ^  y.iki  “  1  —  «,/  +v*,  -  wljkl  =  <■„*/,  wljkl  >  0 

for  /  —  1 . m— 1,  J  —  1 ,  ....  m,  k  =  /  +  1 . m,  I  —  1 ,  ....  m.  I  ^  j 


Suppose  that  x  is  generated  by  Benders'  scheme,  say  as  a  solution  to  the  Master  Problem 
MP{Er. 1).  Since  x  solves  the  quadratic  assignment  problem,  and  since  by  Lemma  1, 
y,iki  “  x„xA/,  then  the  current  incumbent  value  z  is  given  by:  ' 

(5.2)  -  "U1  i  "  A 

+  2.  I,  L  L  tyw  y>ikh 

!-\  /-I  A— 1  +  1  l-l 
l*j 

Note,  however,  that  (5.1)  implies  that  (w,  v,w)  is  indeed  an  optimal  solution  to  problem  SP(x). 
By  examining  the  dual  of  problem  SP(x),  it  immediately  follows  that  the  right  hand  side  of 

m  m 

(5.2)  is  equal  to  £  £  a,,  xu  —  a,  where, 

.-i  /-i 


yielding 

(5.3) 


a„  —  (m  —  i)  «„  +  (/  -  1)  v„,  and 

m—  1  m  m  m 

a  =  Z  Z  £  Z  »uki 

/-I  7-1  *-1+1  /-I 
l*j 


m  m 

2  “II  On  x,;  -  n 

>-l  7-1 


The  next  Benders’  cut  generated  is  of  the  form  r  ^  £  £  a,,  xtj  -  a,  and  hence  an 

l-l  7-1 

optimal  objective  value  zr+l  to  problem  MP(E,)  must  satisfy  the  following  inequality: 

(5.4) 


m  m 

zr"  >  L  Z  a'i  -  « 

<-i  /-i 


In  view  of  (5.3)  and  (5.4),  we  have  z,+l  >  z,  which  is  the  termination  criterion  for  Benders’ 
scheme. 


To  summarize,  if  a  solution  (x,  y)  to  the  linear  relaxation  of  problem  QAP  4  is  integer, 
and  if  xis  generated  at  some  iteration  r  -  1,  then  Benders’  scheme  will  terminate  at  iteration  r 
with  the  conclusion  that  x  is  optimal.  In  particular,  if  x  is  used  as  a  starting  solution,  then  only 
a  single  cut  is  sufficient  to  verify  optimality.  We  emphasize,  however,  according  to  our  compu¬ 
tational  testing,  that  if  the  solution  to  the  relaxed  problem  is  not  integer,  even  if  we  start  with 
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the  optimal  solution  to  the  quadratic  assignment  problem,  many  Benders’  cuts  may  be  needed 
to  verify  optimality.  We  hence  conjecture  that  a  fewer  number  of  Benders’  cuts  may  be 
required  if  the  solution  of  the  linear  relaxation  of  QAP  4  is  close  to  an  integral  solution.  For 
additional  insight  into  Benders'  partitioning  scheme  and  in  particular  to  the  strength  of  bounds 
derived  therefrom,  we  refer  the  reader  to  Rardin  and  Unger  (261. 
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ABSTRACT 

Periodic  mass  screening  is  Ihe  scheduled  application  of  a  test  to  all  members 
of  a  population  to  provide  early  detection  of  a  randomly  occurring  defect  or 
disease  This  paper  considers  periodic  mass  screening  with  particular  reference 
to  Ihe  imperfect  capacity  of  the  test  to  delect  an  existing  defect  and  the  associ¬ 
ated  problem  of  selecting  the  kind  of  test  to  use.  Alternative  kinds  of  tests 
differ  with  respect  to  their  reliability  characteristics  and  their  cost  per  applica¬ 
tion. 

Two  kinds  of  imperfect  test  reliability  are  considered.  In  the  first  ease,  the 
probability  that  the  test  will  delect  an  existing  defect  is  constant  over  all  values 
of  elapsed  lime  since  the  incidence  of  Ihe  defect.  In  the  second  case,  Ihe  test 
will  delect  the  delect  if,  and  only  if,  the  lapsed  lime  since  incidence  exceeds  a 
critical  threshold  T  which  characterizes  the  test. 

The  cost  of  delayed  detection  is  an  arbitrary  increasing  function  (the  "dis¬ 
utility  function")  of  Ihe  duration  of  the  delay.  Expressions  for  the  long-run  ex¬ 
pected  disutility  per  unit  time  are  derived  for  the  above  two  cases  along  with 
results  concerning  Ihe  best  choice  of  type  of  test  (where  the  decision  rules 
make  reference  to  characteristics  of  the  disutility  function). 


INTRODUCTION 

Mass  screening  is  the  process  of  inspecting  all  members  of  a  large  population  for  defects. 
If  the  early  detection  of  a  defect  provides  benefits,  it  may  be  advantageous  to  employ  a  test 
capable  of  revealing  the  defect's  existence  in  its  earlier  stages.  (Throughout  this  paper,  the 
words  "defect"  and  "unit"  or  "individual"  will  refer  to  defect,  disorder,  or  disease  and  to  a 
member  of  the  population,  respectively). 

Defects  may  arrive  in  a  seemingly  random  fashion  such  as  many  types  of  machine  failure, 
the  incidence  of  certain  types  of  cancer,  diabetes,  glaucoma,  heart  disease,  etc.;  or  they  may 
arrive  as  the  result  of  some  contagion  such  as  smallpox,  polio,  etc.  It  is  the  former  type  of 
arrival  process,  random  and  independent  arrivals,  which  is  studied  in  this  paper. 


•This  research  was  supported  by  the  Office  of  Naval  Research  under  Contracts  N000I4-67-0356-0030  and  NOOOI4-75- 
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Continuous  monitoring  would  provide  the  most  immediate  detection  of  a  defect,  but  con¬ 
siderations  of  expense  and  practicality  will  frequently  rule  out  continuous  monitoring  so  that  a 
schedule  of  periodic  testing— a  periodic  screening  program  — may  be  the  most  practical  means  of 
achieving  early  detection  of  the  defect.  In  general  terms,  the  question  then  becomes  one  of 
how  best  to  trade  off  the  expense  of  testing  which  increases  both  with  the  frequency  of  test 
applications  and  with  the  cost  of  the  type  of  test  used  against  the  benefits  to  be  achieved  from 
detecting  the  defect  in  an  earlier  state  of  development. 

The  benefits  of  early  detection  depend  upon  the  application  considered.  For  example,  in  a 
human  population  being  screened  for  some  chronic  disease,  the  benefits  of  early  detection 
might  include  an  improved  probability  of  ultimate  cure,  diminished  time  period  of  disability, 
discomfort,  and  loss  of  earnings;  and  reduced  treatment  costs.  If  the  population  being  screened 
consists  of  machines  engaged  in  some  kind  of  production,  the  benefits  of  early  detection  might 
include  a  less  costly  ultimate  repair  and  a  reduction  in  the  time  period  during  which  a  faulty 
product  is  being  unknowingly  produced.  If  the  population  being  screened  consists  of  machines 
held  in  readiness  to  meet  some  emergency  situation,  an  early  detection  of  a  defect  would 
reduce  the  time  the  machine  was  not  serving  its  protective  function. 

The  expense  of  testing  includes  easily  quantifiable  economic  costs  such  as  those  of  the 
labor  and  materials  needed  to  administer  the  testing.  However,  there  can  also  be  other  impor¬ 
tant  cost  components  which  are  more  difficult  to  quantify.  For  example,  in  the  case  of  a  human 
population  subject  to  medical  screening,  the  cost  of  testing  includes  the  inconvenience  and  pos¬ 
sible  discomfort  necessitated  by  the  test,  the  cost  of  false  positives  which  entails  both  emotional 
distress  and  the  need  to  do  unnecessary  follow-up  testing,  and  even  the  risk  of  physical  harm  to 
the  testee;  e  g.,  from  the  cumulative  effect  of  x-ray  exposure. 

The  design  of  a  mass  screening  program  must  address  two  important  questions.  How  fre¬ 
quently  to  test  and  what  kind  of  test  to  use.  Optimal  testing  frequency  has  been  investigated  as 
a  function  of  the  defect  incidence  rate  and  other  factors  by  Derman  [3],  Roeloffs  (8,9],  Barlow, 
et  al.  1 1 J ,  Keller  (4],  Kirch  and  Klein  [5],  and  Lincoln  and  Weiss  (6]. 

The  second  question  follows  from  the  fact  that  more  than  one  kind  of  test  may  be  avail¬ 
able  for  use  in  a  mass  screening  situation.  The  alternate  tests  can  be  entirely  different  pro¬ 
cedures;  or  they  can  be  the  same  procedure  with  different  criteria  for  what  constitutes  a  positive 
outcome,  e.g.,  alternate  levels  at  which  a  recording  of  systolic  blood  pressure  would  induce 
follow-up  testing. 


Alternative  tests  will  generally  differ  both  in  their  reliability  characteristics  and  in  their 
cost  of  application.  How  to  select  which  test  to  use  is  a  question  which,  to  our  knowledge,  has 
not  previously  been  examined  in  the  context  of  a  general  model.  This  paper  will  examine  this 
question.  For  two  different  ways  of  modeling  test  reliability,  we  develop  a  framework  for  test 
selection  and  present  certain  specific  decision  rules. 


Test  reliability  is  assumed  to  be  a  function,  p(t),  of  elapsed  time,  /,  since  defect 
jl  if  /  €  S 

|0  otherwise  UsualIy  s  wil1  be  an  interval,  e.g.,  (r,°°).  The  two 


incidence.  Define  5S(/) 


cases  pit)  “  p  and  p(t)  =  S( r  „)(,()  are  considered.  p(l)  =  p  indicates  that  test  reliability  is 
independent  of  defect  age.  For  p(t)  =  6(r «,,(/),  the  test  will  detect  a  defect  if,  and  only  if, 
the  defect  has  existed  for  at  least  T  units  of  time.  In  a  sense,  these  two  classes  of  p(i) 
represent  polar  extremes  in  the  responsiveness  of  test  reliability  to  defect  maturity.  The  test 
choice  decision  is  posed  within  each  of  these  two  reliability  classes. 
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A  crucial  feature  in  any  optimization  model  of  mass  screening  is  the  characterization  of 
the  cost  due  to  detection  delay.  Detection  delay  is  the  gap  between  the  time  of  detection  and 
the  time  of  defect  incidence  (or  the  time  the  defect  becomes  potentially  detectable  by  a  screen¬ 
ing  test).  The  mapping  between  detection  delay  and  the  resultant  cost  we  call  the  disutility 
function,  O(-).  Obviously,  the  shape  of  the  disutility  depends  upon  the  particular  application 
considered.  Section  1  gives  examples.  1 

Pierskalla  and  Voelker  [7]  demonstrated-  that  the  shape  of  D()  impacts  the  optimal  alloca¬ 
tion  of  a  screening  budget  among  segments  of  the  client  population  characterized  by  differing 
defect  incidence  rates.  Results  in  Sections  3  and  4  below  demonstrate  the  role  D()  plays  in 
test  choice  decision  rules. 

Previous  research  in  the  area  of  optimal  mass  screening  which  utilized  closed  form 
expressions  for  expected  cost  (or  disutility)  placed  restrictive  assumptions  upon  the  shape  of 
£>(■).  Early  work  assumed  DO)  =  ct  (Barlow,  et  al.  [1]).  In  Kirch  and  Klein  [5], 
DO)  =  min  (t,T)  where  r  is  detection  delay  and  T  is  the  (possibly  random)  delay  between 
defect  incidence  and  the  time  when  the  defect  would  be  discovered  in  the  absence  of  a  screen¬ 
ing  program.  Keller  (4)  restricts  the  generality  of  D()  by  requiring  that  DO)  and  the  test  fre- 

J.l/i-0) 

o  D(s)dsbe  well  approximated  by  D(l/2rO)).  (A 
density  is  employed  to  represent  the  schedule  of  test  times  so  that  the  calculus  of  variations 
could  serve  as  the  optimization  tool.) 

Lincoln  and  Weiss  16}  derive  two  kinds  of  optimal  testing  schedules.  Both  schedules 
maximize  the  time  between  successive  tests  subject  to,  respectively,  a  bound  on  the  mean 
detection  delay  and  a  bound  on  the  probability  that  detection  delay  will  exceed  a  fixed  thres¬ 
hold.  Neither  version  is  equivalent  to  using  a  general  disutility  function. 

A  few  authors,  Schwartz  and  Galliher  [10},  Thompson  and  Disney  (Ill,  and  Voelker  [121 
let  both  the  reliability  of  the  test  and  the  disutility  (or  utility)  of  detection  be  a  function  of  the 
defect’s  state  rather  than  of  time  since  the  defect’s  incidence.  Although  such  models  are  more 
general  and  do  utilize  a  general  concept  of  disutility,  they  have  not  been  amenable  to  closed 
form  evaluation  of  expected  disutility. 

To  incorporate  random  defect  arrivals  into  their  models,  previous  researchers  focus  upon 
an  individual  who  will  incur  the  defect.  They  use  the  density  function  for  the  age  when  that 
individual  incurs  the  defect  as  a  fundamental  element  of  their  model.  Since  the  density  func¬ 
tion  reflects  age-specific  incidence  rates,  a  "life  time”  testing  schedule  can,  thereby,  be 
developed  to  tailor  testing  frequency  at  each  age  to  the  probability  that  the  defect  will  occur  at 
that  age. 

Our  way  of  modeling  the  randomness  of  defect  arrivals  reflects  a  somewhat  different  per¬ 
spective  on  the  mass  screening  problem.  We  look  through  the  eyes  of  a  decision  maker 
charged  with  intelligently  allocating  a  fixed  budget.  The  time  frame  over  which  the  allocation 
must  be  made  is  often  short  compared  to  a  typical  life  time  of  a  member  of  the  client  popula¬ 
tion.  Therefore,  the  decision  maker  does  not  plan  lifetime  screening  schedules  for  particular 
individuals.  Instead,  he  tries  to  maximize  the  benefit  that  can  be  derived  from  his  available 
budget  over  a  much  shorter  planning  horizon.  (For  modeling  purposes  the  objective  of  minim¬ 
izing  expected  long  run  cost  per  unit  time  is  not  unreasonable  with  the  problem  viewed  in  this 
way  since  steady  state  conditions  should  approximately  obtain  after  the  initial  screening.  This  is 
especially  the  case  when  an  existing  and  ongoing  screening  program  is  being  optimized  by  the 
decision  maker.  Also,  lacking  information  to  the  contrary,  the  decision  maker  has  no  reason  to 
anticipate  abrupt  changes  in  the  screening  policy  at  the  end  of  the  planning  horizon.) 
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With  the  problem  viewed  in  this  perspective,  the  random  nature  of  defect  at  rivals  is  most 
naturally  modeled  as  a  Poisson  process  with  its  parameter  determined  by  the  incidence  rate  of 
the  defect  and  the  size  of  the  population.  This  approach  has  proved  particularly  useful  in  the 
following  context:  If  different  segments  of  the  client  population  exhibit  different  incidence 
rates,  subpopulations  can  be  defined  with  defect  incidence  within  each  modeled  as  a  Poisson 
process  with  its  respective  parameter.  Then  the  budget  can  be  so  allocated  among  the  subpopu¬ 
lation  to  permit  appropriate  relative  testing  frequencies  (cf.  Pierskalla  and  Voelker  (7)).  In  this 
way,  age-specific  incidence  rates  can  be  incorporated  into  the  notion  of  Poisson  defect  arrivals. 
Moreover,  factors  other  than  age  which  affect  defect  incidence  rates  (family  history,  smoking 
habits,  work  environment,  etc.)  can  also  be  incorporated  into  the  model. 

Although  this  paper  does  not  follow  Pierskalla  and  Voelker  [7]  in  studying  the  case  of 
heterogeneous  populations,  we  use  the  same  Poisson  model  of  random  defect  arrivals.  Section 
1  presents  examples  of  disutility  functions.  Section  2  sketches  the  basic  structure  of  the  model 
and  represents  the  expected  long  run  disutility  per  unit  time  for  general  reliability  and  disutility 
functions.  Section  3  considers  the  class  of  tests  with  pit)  =  p.  Section  4  does  so  for  pit)  = 
8 1 r. ■» )  ( / ) •  In  both  Sections  3  and  4,  results  regarding  and  test  choice  criteria  are  presented. 
Proofs  are  deferred  to  the  APPENDIX. 

SECTION  1:  SOME  EXAMPLES  OF  DISUTILITY  FUNCTIONS 

Suppose  a  production  process  is  subject  to  a  randomly  occurring  defect.  Although  produc¬ 
tion  appears  to  proceed  normally  after  the  incidence  of  the  defect,  the  product  produced  is, 
thereafter,  defective  to  an  extent  which  remains  constant  until  the  production  process  is 
returned  to  its  proper  mode  of  operation.  The  only  way  to  learn  if  the  production  process  is  in 
this  degraded  state  is  to  perform  a  costly  test.  Now,  if  a  test  detects  the  existence  of  the 
degraded  mode  of  production  t  units  of  time  after  its  incidence,  the  harm  done  will  be  propor¬ 
tional  to  the  amount  of  defective  product  (unknowingly)  produced  which,  in  turn,  is  propor¬ 
tional  to  t.  Hence,  Dit)  =  at  for  some  a  >  0. 

Another  example  where  a  linear  £>(•)  function  may  be  appropriate  would  be  for  the 
periodic  inspection  of  an  inactive  device  (such  as  a  missile)  stored  for  possible  use  in  an  emer¬ 
gency.  If  t  is  the  time  between  the  incidence  of  the  disorder  and  its  detection,  the  disutility 
incurred  is  proportional  to  the  probability  that  the  device  would  be  needed  in  that  time  interval. 
If  such  "emergencies"  arise  according  to  a  Poisson  process  with  rate  p,  then  the  probability  of 
an  emergency  in  a  time  interval  of  length  t  is  1  -  e-'1',  which,  for  p  small,  is  approximately  pt. 
Hence,  if  b  is  the  cost  incurred  should  there  be  an  emergency  while  the  device  is  defective,  and 
if  p  is  the  (small)  arrival  rate  of  emergencies,  then  Dit)  —  bpt. 

A  quadratic  disutility  could  arise  in  the  following  situation.  Suppose  the  magnitude  of  a 
randomly  occurring  defect  increases  linearly  with  time  since  the  occurrence  of  the  defect.  For 
example,  the  magnitude  of  the  defect  might  be  the  size  of  a  small  leak  in  a  storage  container 
for  a  fluid,  and  as  fluid  escapes,  the  leak  gets  larger.  Further  suppose  that  the  harm  done  accu¬ 
mulates  at  a  rate  proportional  to  the  magnitude  of  the  defect.  Hence,  the  quantity  of  fluid  lost 
(at  least  initially)  increases  the  longer  the  defect  exists,  and  the  rate  of  fluid  loss  is  proportional 
to  the  size  of  the  leak. 

Let  the  size  of  the  leak  (as  measured  by  rate  of  fluid  loss),  at  time  s  since  the  leak’s 
incidence,  be  cs.  Then,  if  the  defect  is  detected  at  time  t  since  incidence,  the  disutility  incurred 

(fluid  lost)  is  Dit)  —  f  csds  -  l/2(cf)3. 
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SECTION  2:  EXPECTED  DISUTILITY  WITH  A  GENERAL  RELIABILITY  FUNCTION 

We  assume  that  the  times  of  defect  arrivals  in  the  screened  population  form  a  stationary 
Poisson  process.  Since  there  is  a  certain  intuitive  appeal  to  considering  the  defect  arrival  rate 
proportional  to  both  the  size  of  the  population,  N,  and  the  intrinsic  incidence  rate.  A;  let  Nh 
designate  the  parameter  of  the  above  arrival  process.  It  is  not  necessary  to  know  the  value  of 
A/\  in  order  to  apply  the  decision  rules  of  test  selection  developed  in  this  paper. 

Let  p/(t)  be  the  probability  that  a  test ^>f  type  /  will  detect  a  defect  which  has  been  present 
i  units  of  time.  p,U)  =  0  for  /  <  0.  Let  S*,  be  a  random  variable  denoting  the  time  at  which 
the  k'h  defect  is  detected.  Sk,  depends  upon  the  arrival  time  of  the  defect,  8^;  the  type  of  test 
used,  /;  and  the  testing  frequency,  r. 

Given  the  application  of  test  type  /  at  the  times  {l/r,2/r,  ...|,  the  disutility  incurred  by 
the  Alh  defect  is  D(Sk}  —  Sk).  The  total  disutility  incurred  due  to  defects  which  occur  before 
time  A  is 

£  D(S*;  -  Sk)  h[OM(Sk). 

A- I 


In  Pierskalla  and  Voelker  [7],  the  long  run  expected  disutility  per  unit  time  under  the 
above  screening  program  r,p(  ))  was  shown  to  be 


(2.1) 


00  /*  n/r  — J 

rNk  T  I  D(u)  p,(u)  FT  ll  —  Pi(u  -  m/r )) 

n-\  J'-U'  m-1 


du. 


This  result  will  serve  as  our  starting  piont  for  the  technical  results  of  this  paper.  Lincoln 
and  Weiss  established  essentially  the  same  result  based  upon  a  different  probabilistic  model  of 
defect  arrival. 

SECTION  3:  CONSTANT  TEST  RELIABILITY 

The  mass  screening  model  yields  interesting  results  for  a  test  which  has  a  fixed  probability 
p  of  detecting  the  disorder  if  it  is  present  in  an  individual.  Such  a  model  would  arise  if  the 
unreliability  of  the  test  is  entirely  intrinsic  to  the  test  procedure  rather  than  partially  dependent 
upon  the  state  or  age  of  the  defect.  An  example  of  this  is  the  administration  of  a  Mantoux  test 
for  tuberculosis  in,  say,  a  population  of  grade  school  children.  The  test  has  a  small  but  rela¬ 
tively  constant  level  of  false  negatives.  There  are  other  medical  tests  with  similar  characteris¬ 
tics. 

A  Quality  Control  Example 

To  see  how  another  type  of  situation  with  constant  test  reliability  could  arise,  consider  a 
production  system  which  is  subject  to  a  randomly  occurring  defect  which  degrades  the  system's 
performance.  Once  the  defect  occurs,  the  level  of  degradation  of  the  process  remains  constant 
until  the  defect  is  discovered.  Suppose  the  defect  is  such  that  each  item  produced  has  probabil¬ 
ity  8  of  being  defective  and  that  the  system  without  the  defect  never  produces  defective  items. 


48 


J.A.  VOELKER  AND  W.P.  P1ERSKALLA 


The  only  way  to  discover  the  existence  of  the  defect  in  the  production  system  is  to  exam¬ 
ine  an  item  produced  which  is  itself  defective.  Now,  if  the  examination  of  an  item  is  expensive 
(e.g.,  the  item  is  destroyed  as  a  result  of  the  inspection)  and  if  the  capacity  to  examine  a 
sequence  of  items  involves  a  set-up  cost  (say  a),  the  following  strategy  might  be  called  for:  At 
specified  times  \/r,  2/r,  ....  set  up  the  capacity  to  examine  a  sequence  of  items  and  examine, 

say,  /  items  at  each  of  those  times.  The  times  l/r,  2/r . are  then  the  times  of  testing  and 

sample  size  /  specifies  the  test  type. 

Assume  that  if  a  defective  item  is  examined,  the  defect  is  always  observed  and  the  pro¬ 
duction  process  is,  thereby,  discovered  to  be  in  the  degraded  state.  Hence,  a  degraded  state  of 
the  production  process  will  go  undetected  at  the  testing  occasion  k/r  if,  and  only  if,  each  of  the 
/  items  sampled  at  time  k/r  is,  by  chance,  not  defective.  But  the  probability  of  that  event  is 
(1  -  8)'.  Note  that  the  elapsed  time  t  between  the  entry  of  the  production  process  into  the 
degraded  state  and  the  test  time  k/r  does  not  affect  this  probability.  Hence,  Pi  =  P/(t)  =  1  — 
(1  —8)'  which  represents  the  probability  that  a  test  (the  inspection  of  /  items)  will  detect  a 
degraded  production  process.  Note  that  the  choice  of  /  affects  both  the  test’s  reliability  and 
cost. 

Expected  Disutility 

Designate  the  expected  long  run  disutility  per  unit  time  per  member  of  a  population 
screened  with  frequency  r  using  a  test  of  constant  reliability  (p(D  =  p,  t  >  0)  by  C(r,p). 
From  Eq.  (2.1), 

00  /•tn  +  D/r 

C(r,p )  =«  N\rp  £  (1  -  p)"  l  D(u)du. 

J«!r 


The  question  examined  next  is  how  do  changes  in  rand  p  affect  C(r,p).  After  that,  expli¬ 
cit  solutions  are  given  for  C{r,p)  when  D(  )  takes  certain  simple  forms.  And  lastly,  some  gen¬ 
eral  rules  are  indicated  for  selecting  between  a  particular  kind  of  test  and  a  more  expensive  but 
more  reliable  alternative  test  when  D(  )  takes  certain  forms. 

PROPOSITION  1:  If  D(-)  is  a  strictly  increasing  function,  ldC(r,p)]/dp  <  0  and 
laC(r,p)l/8r  <  0. 

Note  that  for  £>(■)  nondecreasing,  the  above  inequalities  still  hold,  but  not  strictly. 

From  this  proposition,  as  anticipated,  when  p  increases,  the  expected  disutility  decreases. 
Similarly  as  r  increases,  the  interval  l/r  between  tests  decreases  and  the  expected  disutility 
decreases.  Consequently,  as  better  test  types  are  used  or  the  tests  are  more  frequently  applied, 
the  value  of  such  changes  in  terms  of  reduced  disutility  versus  the  costs  of  the  changes  can,  in 
principle,  be  assessed  and  the  tradeoffs  evaluated. 


It  is  easy  to  compute  the  Hessian  for  C(r,p)  when  £>(•)  is  differentiable 
d2C 


dr2 
d  2C 


Nkp  £  q"r3 

n— 0 


{n  +  l)2  D‘ 

(»  +  1) 

-  n 2  D 

n 

r 

r 

NAr  £  n(n  -  1)  qn~2  l^(n.r)  -  W(n  -  l.r)]. 

Vr  n-2 


d2C 


a  a  /VX  £  (1  -  (n  +  l)p)  q"  1  I tV(n.r)  -  B(n  +  l.r)  +  fl(«,r)l, 
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where 

/•(fl  +  l)/r 

D(s)ds,  and  B(n,r)  —  —  D(n/r). 

n/r  r 

Note  that  if  D'(  )  is  increasing,  then  (8 2C/dr2)  >  0  and  (8 2C/dp2)  ^  0.  Hence,  along 
coordinate  directions  CU,p)  is  convex. 

Simple  expressions  for  C(r,p)  can  be  given  when  D(-)  is  specialized  to  a  polynomial  or  an 
exponential  function.  Since  these  two  types  of  functions  are  reasonably  general,  they  can  be 
quite  useful  as  realistic  approximations  in  applications. 

PROPOSITION  2:  If  DU)  -  £  a,  r"\  then 
/3  j)  CU.p)  -  Nkp  £  +  1)]  £  n"'+'pq"~\ 


If  the  m„  i  =  1 . N  are  positive  integers,  the  inner  summation  of  Eq.  (3.1)  is  simply 

the  (m,  +  1)  moment  of  a  geometric  random  variable.  Hence,  using  Laplace  transforms,  Eq. 
(3.1)  becomes 

(3  2)  C(r,p)  -  N\p  £  la,/rm'(m,  +  1)1  (0 \p) 

where  <li(t:p )  =  pe'/(  1  -  qe')  and  (m)(0:p)  =  dm  >bU:p)/dtm  |,_0.  For  example,  when  m  =  1, 
C(r,p)  =  (aN\/2r)[(2  -  p)/p]  and  for  m  »  2,  C(r,p)  =  [(aNK/ir2)]  (1  +  (6  -  6 p)/p2). 

PROPOSITION  3:  If  DU)  =  Pea'  for  a,  (i  >  0,  then  CU.p)  =  &N\pr(ea/r  -  1)/ 
a{\  -  qea,r),  for  r  >  -fl/log(</). 

Test  Selection 


Propositions  2  and  3  can  provide  a  means  to  select  between  two  alternative  kinds  of  tests 
which  differ  with  respect  to  reliability  of  detection  and  cost  per  application.  Let  test  No.  1  have 
cost  per  application  C|  and  reliability  p\.  The  corresponding  parameters  for  test  No.  2  are  ci 
and  p2.  If  test  No.  1  is  administered  with  frequency  rc2/c\  and  test  No.  2  administered  with 
frequency  r,  both  testing  regimes  will  consume  equal  quantities  of  the  budgeted  resource;  vis., 
Nrc2  per  unit  time.  If  C (rc2/ c^,p{)  ^  C(r,p2)  for  all  r  ^  0,  then  the  expected  disutility  per 
unit  time  will  be  less  with  test  No.  1  at  all  levels  of  budget  Nrc2.  That  is,  if  test  No.  2  is  being 
used  with  frequency  r,  the  expected  disutility  can  be  decreased  without  any  additional  allotment 
of  budget,  simply  by  switching  to  test  No.  1  and  testing  as  frequently  as  the  budget  permits. 


Suppose,  for  example,  that  DU)  =  atm  for  m  a  positive  integer.  Then  CTrc^/c,,/?,)  < 
C ( r,p2)  is  equivalent  by  Eq.  (3.2)  to 

(„/„)-  -  « ^ 


(3.3) 


P  i  </» 


(m  +  l) 


(0:p,) 


Therefore,  test  No.  1  is  preferred  over  test  No.  2  if,  and  only  if,  Eq.  (3.3)  obtains. 


50 


J  A  VOELKER  AND  W  P  PIERSKALLA 


SECTION  4:  THRESHOLD  TEST  RELIABILITY 

In  the  previous  section ,  the  reliability  of  the  test  depended  only  on  factors  intrinsic  to  the 
test  itself  and  did  not  depend  at  all  on  the  elapsed  time  since  incidence  at  the  time  of  the  test. 
In  this  section,  a  special  form  for  p(t)  is  considered  which  is  very  different  from  the  case  of 
constant  test  reliability.  Here  the  test  reliability  is  zero  if  the  elapsed  time  since  the  defect’s 
incidence  is  less  than  T ;  otherwise,  the  reliability  is  one.  That  is,  pit)  =  S ( r  «, , ( / )  where  the 
number  7"  is  a  characteristic  of  the  type  of  test  chosen. 

For  a  screening  program  in  which  a  test  with  the  above  reliability  characteristics  is  applied 
with  frequency  r,  let  A  (r.T )  represent  the  expected  long  run  disutility  per  unit  time. 

Of  course,  the  "blind  period"  of  the  test  for  0  to  T does  not,  in  a  mass  screening  situation, 
delay  detection  of  each  arriving  defect  exactly  T  units  of  time.  The  amount  of  delay  depends 
on  the  interplay  among  the  time  of  arrival  of  the  defect,  the  testing  schedules  [Mr.  2/r,  . . . ), 
and  the  magnitude  of  T. 

The  primary  results  in  this  section  are  a  simple  characterization  of  A  (r.T)  and  rules 
which,  in  some  cases,  will  permit  selection  betwseen  two  tests  which  differ  in  their  reliability 
(i.e.,  in  their  detection  threshold  T)  and  in  their  cost  per  application. 

Long  Run  Disutility 

PROPOSITION  4:  If  p()  =  8|roo)(  )  for  some  T  ^  0,  then 

A  (r.T)  —  A'Ar  f  D(T  +  u)  da. 

Jo 

Suppose,  for  example,  that  D(t)  =  exp(ar)  for  a  >  0.  Then,  A(r,T)  = 
N\r  exp(flD[exp(a/ r)  -  ll/a. 

Test  Selection 

Suppose  the  decision  maker  has  two  kinds  of  tests  available  and  he  must  choose  one  of 
them  for  implementation  in  a  mass  screening  program.  Suppose  the  first  kind  of  test— call  it 
test  No.  1—  has  sensitivity  characterized  by  the  "time-until-detectability”  threshold  T,.  Let  C| 
>  0  be  the  cost  per  application  (to  an  individual  unit)  of  this  kind  of  test.  For  the  second  kind 
of  test  under  consideration,  test  No.  2,  let  T2  and  c2  be  the  corresponding  parameters. 

Assume  test  No.  1  is  better  in  the  sense  that  T\  <  T2.  To  avoid  triviality,  assume 
c,  >  c2. 

If  the  exact  shape  of  the  function  D(  )  is  known.  Proposition  4  can  be  used  to  decide 
which  test  to  use  for  each  possible  level  of  budget.  Let  b  be  the  budget  per  unit  time  per  indi¬ 
vidual  in  the  population.  Then  the  use  of  test  No.  1  will  permit  a  testing  frequency  of  b/c\  and 
the  use  of  test  No.  2  permits  frequency  b/c2.  To  decide  which  test  to  use,  compare  the 
expected  disutilities  per  unit  time  assuming  a  fully  allocated  budget,  i.e.,  compare  A  (b/t'\.T\) 
and  A  ( b/c2,T2 ).  With  D(  )  known,  these  quantities  can  be  evaluated  explicity  by  Proposition  4 
and  compared. 

It  is  clear  that  the  entire  budget  should  be  allocated  because,  when  D(  )  is  an  increasing 
function, 
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f-  A(r’T) 

dr 


A/A 


[/„ 


'U  D{T+  u)  -  (1  /r)  D(T+  -)  du 
o  r 


<  0. 


When  the  exact  form  of  the  disutility  function  is  not  known,  Proposition  4  does  not 
suffice  to  select  between  tests  No.  1  and  2.  However,  the  two  following  theorems  will  permit 
such  a  determination  at  least  for  certain  relative  configurations  of  budget,  relative  test  sensi¬ 
tivity  T2  —  T\,  and  cost  differential  c2  -  ct  of  the  tests. 

Specifically,  Proposition  5  will  show  that  for  any  (increasing)  disutility  function,  test  No.  1 
is  indicated  if  the  budget  (per  unit  population)  exceeds  ( c]  —  c2)/ ( ^2  —  7j).  On  the  other 
hand.  Proposition  6  shows  that  for  a  convex  increasing  disutility  function,  test  No.  2  is  better  if 
the  budget  is  less  than  (c,  -  c2)/[2(  T2  -  T,)]. 

A  decision  rule  for  the  case  where  the  budget  falls  between  (c,  -  c2)/(T2  -  T{)  and 
(c,  -  c2)/[2(T2  -  7',))  has  not  been  found  for  general  disutility  functions. 

Just  how  the  statement  of  Propositions  5  and  6  are  translated  into  the  above  decision 
rules  is  explained  after  the  statements  of  the  respective  theorems. 

PROPOSITION  5:  Given  £>(■)  a  strictly  increasing  function,  Tt  <  T2  and  c*  >  c2,  then 
T2  -  T\  >  (c  1  -  c2)/rc2  implies 

(4.1)  A  (rc2/c{,T[)  <  A  (r,T2), 

making  test  No.  1  preferable  at  the  per  unit  population  budget  level  of  rc2. 

To  apply  test  No.  I  with  frequency  rc2/c\  versus  test  No.  2  applied  with  frequency  r 
(actions  reflected,  respectively,  in  the  left-  and  right-hand  sides  of  Eq.  (4.1))  would  require  the 
same  budget,  b  =  rc2,  per  unit  population.  The  hypothesis  of  theorem  2  implies  b  =  rc2  > 
(ci  -  c2)/(T2  -  T |).  With  the  hypothesis  in  this  form,  the  theorem  provides  a  lower  bound  on 
the  budget  which  is  a  sufficient  condition  for  test  No.  1  to  entail  lower  expected  disutility  per 
unit  time  vis-a-vis  test  No.  2,  were  the  two  tests  scheduled  at  their  maximal  (subject  to  budget) 
frequencies  rc2/c\  and  r,  respectively. 

The  following  lemma  is  needed  for  the  proof  of  Proposition  6  and  is  recorded  here  for 
general  interest. 

LEMMA  1:  If /is  a  convex  function,  the  0 />)J^  f(s)ds  <  (l/2)[/(r)  +  f(t  +  y)]. 

PROPOSITION  6:  If  £>(•)  is  convex  and  increasing,  7j  <  T2  and  C!  >  c2,  then  T2  -  7j 
<  (c,  -  c2)/2rc2  implies  A  (rc2/c\,T{)  ^  A{r,T2 ),  making  test  No.  2  preferable  at  the  per  unit 
population  budget  level  of  b  -  rc2. 

The  hypothesis  of  this  proposition  implies  b  -  rc2  <  (c|  -  c2)/2(T2  —  7j).  Hence,  Pro¬ 
position  6  indicates  the  superiority  of  test  No.  2  when  the  budget  per  unit  population  is  less 
than  (c |  -  c2)/2(T2  -  7j). 
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APPENDIX 


PROPOSITION  1:  With  q  -  1  -  p. 


C(n,p)  =  Nkr  £  qnW(n)  -  Nkrp  £  n  qn'x  W(n) 

OP  n-0  n-  0 


where 


(n+\)/r 


Win)  -  f  D(s)ds. 

n/r 

Now  let  V0  -  W0\  Vn  -  Wn  -  Wn_x,  n  -  1,  2 .  Note  that  D{  )  non-negative  increasing 

implies  Vj  >  0  for  j  -  0,  1,  2 . Then  W„  -  £  Vj  and 


y-o 

~  C(r.p)  -  N\r  £  Vj  -  N\rp  £  n  q"~x  £  Vj 

OP  n-0  y-0  n-0  y-o 

£ni£ 

y-0  n-y  n-y 

-  -Nkr  £  j  qJ~l  Vj  <  0. 


y-o 


(n  +  l)/f 


—  C(r,p)  -  N\p  £?'’/.  D(s)ds 
or  nZo  "h 


~  Nkp  £  q' 
«— 0 


n  4-  1  ^  |  n  +  1 


-  -  D 
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Nkp  £  q" 
n- o 


(«  +  !)//■ 


f, 


,  D(s)ds  -  - 

n/r  r 


D 


n  +  1 


n  +  1 


<  0. 


This  inequality  follows  from  D(  )  increasing  through  the  relations  D[(n  +  l)/r)  -  D(n/r)  > 
0  and 


(rt  +  l)/r 


f  D(s)ds  -  -  D 
r 


n  +  1 


(rt  +  l)/r 

<  ° 


"  +  1  U  -  ±  D 


n  +  1 


0. 


Q.E.D. 


(n+l)/r  , 


PROOF  OF  PROPOSITION  2:  C(r.p)  =  Nkrp  £  q"  J  £  a, 


„  -  «/r  •  i 

n-0  /—I 


=  Nkrp  t  I  a,  +  1) 

i-l  n-0 

/ 


n  +  1 

m(+ 1 

n_ 

m,+  l 

r 

r 

WAp  £  ajr  '( m ,  +  1) 

/-I  |«-1  n  —  1 

1  -  ”  m+l 


I  </'-1nm'+1  -  I 

NAp  X  oJrm'(m,  +  1)  £  a"’,T'  p  qn~x 


m,  + 1 


-1 


«  —  1 


Q.E.D. 


PROOF  OF  PROPOSITION  3:  C(r,j>)  -  WAp  £  <?*-  ca*/f  (ea/f-  1) 

*-o 

-  («?■>''-  1)  £  (q  ea/l)k. 


k—0 


The  geometric  series  £  ( q  ea/l)k  converges  if,  and  only  if,  r  >  —a/ log  q.  Therefore,  for  r  > 


-a /log  <7, 


k—0 


(«•"-!)  fTT^' 


Q.E.D. 


m 

u - 

r 


PROOF  OF  PROPOSITION  4:  Note  that 

n  ^  “  8(r.«») 

m  —  1 

By  Eq.  (2.1), 

A(r,T)  -  WAr  t  D(u)8[r  oo)  (m) 

n-l  m-l 

-  «  'I  <«) 


1  -  8[r  „„) 


u - 1. 

r  \ 


1  -  8|r.«) 


n-l 


1 

1  -  817-  00) 

u - 

r 

m 

u - 

T 


du. 


du 
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Now  u  <  T  +  (Mr)  u  —  (\/r)  <  T  1  —  8(roo)  [w  —  (1/r)]  =  1.  Therefore, 

A(r.p)  «  NXr  f  D(«)8lr  „>  (a)  8(_o„  r+i/r)(« W« 

p  T+M r 

—  N\r  D(u)du  Q.E.D. 

PROOF  OF  PROPOSITION  5:  By  Proposition  4,  Eq.  (4.1)  is  equivalent  to 
(rc2/c\)  '  D(T\  +  s)  ds  <  r  fo  D(T2  +  s)  ds. 

By  hypothesis,  T2-T\  >  (c t  -  c2)/(rc2)  which  implies  T2  —  T\  >  u(c\-c2)/c2  for 
u  €  [0, 1/r]  or  T2  -  7",  +  u  >  ucjc2  for  u  €  (0, 1/r].  Since  £>(•)  is  increasing,  this  inequality 
implies 

rc->  rcd*-rcJ  r{/r 

-  Jo  D(T|  +  s)  ds  -  rJQ  D(T\  +  ucjc2)  du 

<  r  Jfl' "  D(T\  +  ( 7*2  -  T,  +  u))  du  =  r  JQ'  D(T2  +  u)  du. 

Q.E.D. 

PROOF  OF  LEMMA  1:  Let  h(t  +  s)  =  f(t)  +  (s/y)[f(t  +  .y)  -  /(f)],  s  €  (O.y). 

1  cr+y  1  r*+y 

Since  /  is  convex,  f(t  +  s)  <  h(t  +  s),  s  €  [0,y].  Hence,  —  J  f(s)ds  <  —  J  h(s)ds 

—  —  f  h(t  +  s)ds  =  ^-[/(r  +  y)  +  /(f)]. 
y'  Jo  2 

Q.E.D. 


PROOF  OF  PROPOSITION  6:  By  Proposition  4,  it  suffices  to  show 

(rc2/c\)  r  12  D(r,  +  u)  du  >  r  Jo'  D(r2  +  m)  </w. 

Letting  /(/)  =  0(7",  +  /)  and  T  =  f2  -  Tj,  this  becomes 


(A.l) 


(rc2/c\)  fc'lrC'f(s)ds  >  r  f(T  +  s)ds 


Note  that  D(  )  being  an  arbitrary  convex  increasing  function  implies  /(•)  is  also  convex 
increasing. 

Let  x  =■  cjrc2  and  let  a  =  c2/c\  <  1.  Then  Eq.  (A.l)  becomes 

/.  T +A.t  /•  x 

T  f(s)ds  <  (1/jc)  J0  f(s)ds 

which  will  follow  from 


t  r  I  +O.V 

-  Jr  As^ds  < 

ax 


»  T  A- ax 


l 


J*2  r+a.v  i  /».v 

/(s)c/s  <  —  \  f(s)ds 

0  X  *,0 


(A. 2)  ax  •'r  '  '  2T  +  ax 

Note  that  the  above  expressions  are  all  average  values  of  f(s)  over  their  respective  intervals  of 
integration.  The  right-hand  inequality  follows  from  /(•)  increasing  and  2  7"  + ax  <  x.  To 
prove  the  theorem,  it  only  remains  to  estabilsih  the  left-hand  inequality  in  Eq.  (A. 2). 


(A. 3) 


1 


f 

2  T  +  ax  Jo 

fT  d 
~  Jo  <78 


2T+a.\ 


1  T +ax 

f(s)ds - J  f(s)ds 

ax 

1  r  r+o  .ir+8 


28  +  ax 


d  8 


. . 
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jl_ 

1 

<78 

ax  +  28 

2 

y  (/( r 

n.v  +  28 

/>  r  4-1 1.V  4  A 

r-»  /(s)‘/s 


I 


/’  4  m  ♦  ft 


f(s)ds 


ax  +  28  J  r-t 

which  is  non-negulivc  lor  8  €  [0.7 1  by  Lcnimu  I.  Therefore,  (A. 3)  is  non-negative  anti 

I  fir  f  <n  •  c  ]•+, ly 

2  r  +  ax  -t  ,is)ds  >  Jr  /(l)</v 


7‘4-m 
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Albuquerque,  New  Mexico 

ABSTRACT 

This  paper  extends  the  Low-Lippman  M/M/1  model  to  the  case  of  Gamma 
service  times.  Specifically,  we  have  a  queue  in  which  arrivals  are  Poisson,  ser¬ 
vice  time  is  Gamma-distributed,  and  the  arrival  rate  to  the  system  is  subject  to 
setting  an  admission  fee  p.  The  arrival  rate  A(p)  is  non-increasing  in  p.  We 
prove  that  the  optimal  admission  fee  />*  is  a  non-decreasing  function  of  the  cus¬ 
tomer  work  load  on  the  server.  The  proof  is  for  an  infinite  capacity  queue  and 
holds  for  the  infinite  horizon  continuous  time  Markov  decision  process.  In  the 
special  case  of  exponential  service  time,  we  extend  the  Low-Lippman  model  to 
include  a  state-dependent  service  rate  and  service  cost  structure  (for  finite  or 
infinite  time  horizon  and  queue  capacity).  Relatively  recent  dynamic  program¬ 
ming  techniques  are  employed  throughout  the  paper.  Due  to  the  large  class  of 
functions  represented  by  the  Gamma  family,  the  extension  is  of  interest  and 
utility. 


In  this  paper,  we  consider  an  M/ Ek/\  queuing  system  in  which  the  arrival  rate  to  the  sys¬ 
tem  is  subject  to  control  by  the  service  facility.  Low  (10)  considered  a  similiar  control  model 
for  M/M/C  systems  and  Lippman  [6]  has  approached  the  Low  model  using  new  techniques.  In 
this  paper  we  extend  Lippman’s  treatment  (see  pp  707-708  of  (6))  to  the  case  of  gamma  service 
time  distribution. 

The  model  we  treat  is  the  following.  Customers  arrive  to  a  single  server  infinite  capacity 
queue  according  to  a  Poisson  process  having  rate  A  (/>),  where  p  is  the  decision  variable 
representing  the  price  charged  for  admission  to  the  system.  We  take  p  €  P  and  assume  A  is  a 
nonincreasing  function  of  p.  The  service  time  is  gamma  with  parameters  (k,p)  (defined  such 
that  the  mean  service  time  is  k/p).  We  think  in  terms  of  the  customer  going  through  k  serial 
phases  of  service,  each  having  exponential  service  time  with  rate  p  >  0.  Furthermore,  we 
assume  that  these  phases  are  observable  by  the  decision  maker  (See  "Applications"  below,  and 
the  comment  following  Theorem  2).  We  define  the  state  of  the  system  to  be  the  number  of 
phases  of  service  left  to  complete.  Thus,  if  there  are  c  >  1  customers  in  the  system,  and  the 
customer  being  served  has  /  phases  of  service  remaining,  the  state  of  the  system  would  be 
(c  —  1)A  +  /.  Denote  the  state  by  /  and  the  state  space  by  S  «  {0, 1, 2,  .  .  .}. 


The  cost  structure  has  two  components.  First  the  admission  price  p  is  treated  as  a  cost 
—p.  Secondly,  there  is  a  holding  cost  h O')  defined  as  a  function  of  the  state  variable.  The 
s  optimality  criterion  is  the  minimization  of  expected  discounted  cost.  We  treat  the  infinite  time 
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horizon.  This  defines  our  model  as  a  continuous  time  Markov  decision  process  (see  Ross  [11], 
Bertsekas  [2]). 

Our  aim  is  to  prove  that  the  optimal  admission  price  is  a  nondecreasing  function  of  the 
state  variable.  In  addition,  we  extend  the  results  of  the  Low-Lippman  model  [6]  to  include 
state-dependent  service  rates  and  service  cost  rate  in  the  special  case  of  exponential  service  dis¬ 
tributions  (k  =  1). 

In  concluding  this  summary  we  remark  that  the  techniques  employed  in  this  paper 
represent  an  effort  to  employ  more  general  dynamic  programming  tools  which  have  recently 
appeared  in  the  literature.  Lippman  [6]  notes  the  desirability  of  a  more  general  class  of  tech¬ 
niques  applicable  to  queuing  optimization  problems.  In  this  paper  we  have  formulated  our 
problem  in  the  general  setting  of  Topkis’  framework  for  analyzing  monotone  policies  [13], 
Lippman’s  approximating  techniques  for  finite  and  infinite  horizon  processes  [6,8],  and 
Lippman’s  recent  result  on  unbounded  1-period  cost  functions  [7]. 

APPLICATIONS 

Since  an  arrival  to  the  system  by  a  single  customer  results  in  the  state  variable  increasing 
by  k  units,  we  may  view  the  model  as  applying  to  batch  arrivals  of  jobs  to  a  single  facility  server 
who  services  each  job  in  series  (at  an  exponential  service  rate),  completing  the  entire  batch 
before  starting  work  on  the  next  batch.  This  would  apply  to  batch  arrivals  by  identical  jobs  to  a 
machine  for  processing.  Or,  taking  the  other  view  that  arrivals  to  the  system  represent  single 
jobs,  the  gamma  service  time  model  would  apply  to  situations  where  the  server  must  perform  a 
series  of  k  well-defined  (observable)  processing  operations  on  each  job,  where  each  of  the  k 
sublasks  are  (essentially)  identical  in  service  time  requirements. 

The  entrance  price  introduces  a  revenue  component  to  the  models.  Holding  cost  might 
reflect  costs  of  in-process  inventories  or  higher  machine  (or  service  facility)  maintenance  cost. 

Finally,  the  extension  of  the  Low-Lippman  model  to  accommodate  state-dependent  arrival 
rates  is  applicable  to  two  general  classes  of  situations:  (1)  Where  a  single  service  facility  adjusts 
its  service  rate  in  response  to  increased  work  load  by  speeding  up  or  slowing  down,  and  (2) 
where  such  adjustment  is  made  by  adding  or  relieving  additional  service  facility  support  (for 
example,  when  the  service  facility  is  a  team,  by  adjusting  the  number  of  team  members-at  pos¬ 
sibly  a  higher  service  cost  rate). 

1.  EQUIVALENCE  BETWEEN  CTMDP  AND  DTMDP 

In  this  section  we  summarize  a  new  technique  for  transforming  a  continuous  time  Markov 
decision  process  (CTMDP)  to  an  equivalent  discrete  time  Markov  decision  process  (DTMDP). 
Lippman  appears  to  be  the  first  to  employ  the  method  for  queuing  optimization  problems  [6]. 
See  also  [8],  Serfozo  [12],  Kakumanu  [4],  Winston  [14],  and  Anderson  [1]. 

For  the  following  discussion  refer  to  Ross  [11]  or  Bertsekas  [2],  A  DTMDP  is  defined  by 
specifying  four  objects:  a  state  space  S ,  a  collection  of  action  spaces  {A,:i  €  S),  transition  proba¬ 
bilities  {Pij(a)\i,  j  €  s,  a  €  A,),  and  cost  functions  {C(/,a);/'  €  S,  a  €  A,}.  For  a  CTMDP,  in 
addition  to  these  four  objects  we  must  specify  transition  time  probability  distributions 
\Fij(a):i,  j  €  S,  a  €  A,}  where  F,j  (a)  is  an  exponential  distribution  having  parameter  A  (/.a). 
Also  costs  C(i,a)  are  realized  per  unit  time.  For  either  process  we  specify  a  discount  factor 
a,0  <  a  <  1  and  define  V  to  be  the  optimal  return  function.  F(/)  equals  the  minimum 
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expected  discounted  cost  realized  over  an  infinite  time  horizon  starting  in  state  i  6  S.  We  may 
then  define  the  following  functional  equations  of  dynamic  programming. 

For  the  DTMDP,  0  <  n  <  +  and  i  €  S, 

(l)  M<)  =  o 


(2) 


W') 


(We  define  =  V.) 


oo 


min{C(/,a)  +  £  P^a)  Vn (/')}/ (A  +a) 

aiAi  7-0 


For  the  CTMDP, 


(3) 

where 

and 


VU) 


=  min{C(/,a)  +  q(i,a)Y^  Pu(a)V(j)} 

q(i,a)  =  X  (i.a )/  (a  +  X(/,a)) 

C(i,a )  =  C(/.a)/(a  +  X(/,a)). 


A  policy  is  a  rule  for  choosing  actions,  i.e.,  is  a  function  8:T  x  S  —< >  A  where  Tis  the  time  axis 
of  the  process  (T  =  {0, 1,2,  .  .  .)  for  DTMDP  and  T  =  [O.L],  L  ^  +  °°  for  the  CTMDP),  and 
8 (/,/)  €  A i  for  each  i  6  S.  Define  K8  to  be  the  return  function  when  using  policy  8.  An 
optimal  policy  8  *  is  one  satisfying  *  =  V.  8  *  is  stationary  if  it  is  constant  on  the  factor  T. 
Conditions  which  insure  the  existence  of  an  optimal  stationary  policy  are  specified  by  the  well- 
known  contraction  and  monotonicity  properties  of  Denardo  [3].  When  an  optimal  stationary 
policy  exists,  V  then  satisfies  equation  (2)  (in  the  case  of  the  DTMDP)  or  (3)  (in  the  case  of 
the  CTMDP). 


Given  a  CTMDP  for  which  an  optimal  stationary  policy  exists,  assume  that 
A  =  sup  {\(i,a):i  €S,  a€/4}<+°°. 

Define 

(4)  | (A  —  \(i,a))/\  i  =  j 

V  -  |x(/,a)/^  (a)/A  /  ^  j 

It  can  then  be  shown  (see  [6,8],  [4],  [12],  [5,  p.  121],  or  [1])  that  the  CTMDP  is  computation¬ 
ally  equivalent  to  a  DTMDP  having  the  same  state  and  action  spaces,  cost  function 
C(/,fl)/(A  +  a),  transition  probabilities  Py'(a),  and  discount  factor  A/(A -I- a).  Thus,  the 
optimizing  actions  for  the  CTMDP  can  be  computed  from  the  following  functional  equations, 
for  /  €  S, 

(5)  F(/)  =  min{C(/,a)/(A  +  a)  +  (A/ (A  +  «))£  Py'ia)  K0')1 

a£Af 

(In  (3),  \  (i,a)Pjj(a)  is  typically  referred  to  as  the  rate  of  the  transition  i  — *  j).  For  a  treatment 
of  a  CTMDP  having  lump  sum  costs  see  [1]  or  [12], 

2.  MONOTONE  OPTIMAL  POLICIES 

In  a  somewhat  technical  paper,  Topkis  [13]  presents  a  general  framework  for  analyzing 
monotone  optimal  policies  in  optimization  problems.  For  our  purposes  here,  we  specialize  his 
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results  to  the  problem  we  will  be  treating.  Let  J  be  a  function  defined  on  A  x  B,  where  A  and 
B  are  lattices.  A  function  J  is  said  to  be  submodular  on  A  x  B  if  for  a |(  a2  €  A,  b\,  b2, 

€  B,  flj  >  a |,  b2  >  b\,  Jiau  b |)  +  Jia2>  b2)  <  Jia b2)  +  Jia2,  b |).  Suppose  we  have 
the  following  optimization  problem,  /(<?)=  min  J  ia,  b).  Let  b*ia)  be  the  infimum  of  the  set 

btB 

of  points  in  B  which  minimizes  Jia,  b).  Then  under  certain  conditions  on  A,  B.  and  J.  Topkis 
shows  that  to  establish  that  b*  is  non-decreasing  in  a,  it  is  sufficient  that  J  be  submodular  on 
A  x  B.  (See  Theorem  6.2  of  [13]  for  details.) 

3.  AN  M/Ek/l  QUEUE  WITH  CONTROLLED  ARRIVAL  RATE: 

THE  INFINITE  TIME  HORIZON  PROBLEM 

We  make  the  following  assumptions: 

(1)  p  €  P,  where  Pis  a  compact  subspace  of  (0,  +  °°], 

(2)  A  is  a  nonincreasing  continuous  function  of  p  €  P.  Also  for  p  €  P,  0  <  A  ip)  <  + 

oo. 

(3)  /;  is  nondecreasing  in  /  €  S  =  [0,1,2  .  .  .]  and  satisfies  //(/  +  k)-h(i  +  k  -  1) 
>  //(/)  -/;(/  -  1),  /  >  1. 

Notation:  Let  /be  a  real-valued  function  on  [0, 1,2,  . . .).  Define  A/0)  =  /(/)-/(/  -  1).  /  ^ 

1.  Define  A  =  sup  A (p)  +  u. 

riP 

From  Section  1,  the  equivalent  discrete  time  recursions  for  our  CTMDP  are  easily  seen  to  be 
the  following: 

For  /  €  S, 

(6)  y0o)~  o 

(7)  V„  (/)  =  min  J (n,i,p)/ (A  +a),  n  >  0 

p 

where  J  is  defined  as  follows: 

For  i  =  0,  and  0  ^  n  <  4-  °° , 

(8)  J(,n,0,p)  -  -p\(p)  +  \(p)  V„-\{k)  +  Fn_!(0)(A  -  A (/?)), 
and  for  /  >  0, 

J(n,i,p)  =  h(i)  -  p\(p)  +  kip)  +  k)  +  p.  -  1) 


(9)  +  Ffl_,(/)(A  -  kip)  -m) 

Notation:  In  (7)  define  the  infimum  of  the  minimizing  actions  by  p*  ii.n)  for  n  <  +  oo  and 
p*ii)  when  «  =  +  <».  Also  write  =  V  and  when  n  =  +  °°  we  suppress  n  in  the  list  of 
arguments  for  J,  i.e.,  we  will  write  Jii.p). 

THEOREM  1 .  There  exists  an  optimal  stationary  policy  to  the  CTMDP  defined  above. 

PROOF.  The  primary  consideration  here  is  the  presence  of  the  unbounded  1 -period  cost 
function  h.  Although  Denardo’s  results  [3]  do  not  directly  apply,  Lippman  [7]  has  established 
conditions  under  which  Denardo’s  contraction  and  monotonicity  results  do  apply.  In  our  case  it 
is  trivial  to  verify  that  Assumptions  1,2,  and  3  of  [7,  p.  1227]  are  satisfied,  and  the  theorem  is 
established.  ■ 
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We  now  establish  two  supporting  lemmas. 

LEMMA  1.  For  0  <  n  <  +  <»  and  i  >  1,  A  V„U)  >  0. 

PROOF.  We  prove  the  lemma  for  finite  n  (by  finite  induction);  the  case  for  infinite  n  then 
follows  by  taking  limits.  From  (6)  the  result  holds  trivially  for  n  =  0.  First  assume  /  >  1  and 
define  p  -  p*(n,i).  Then  from  (7)  and  (9), 

F„0)  -  K,(i  ~  D  >  J(n.i,p)  -  J(n,i  -  ,p ) 

»  A hU)  +  AF„_,(/  +  k)\(p)  +  A  F(I_1(/)(A  -  X(p)  -  m>  +  A  V„-\(i  -  l)/a 

^  0,  where  the  inequality  follows  from  Assumption  (3),  the  definition  of  A,  and  the  inductive 
hypothesis  AF'n_1^0.  When  i  =  1,  the  same  computation  holds  except  that  the  term 
A  -  1)m  is  not  present.  ■ 

The  next  result  establishes  a  condition  on  Vn  which  is  somewhat  weaker  than  the  convex¬ 
ity  required  in  Lippman’s  treatment  (6,  p.  708,  Theorem  15). 

LEMMA  2.  For  0  <  n  <  +  °°  and  /  ^  1,  A  Vn(i  -f  k)  ^  A  V„(i) 

PROOF.  For  n  =  0,  the  result  holds  trivially.  Assume  it  true  for  n  —  1  <  +  °°.  Define 

Pi  =  p*(n,/'-D  and  p2  =  p*(n,i  +  k).  Then  for  i>  1,  from  (7),  (8)  and  (9), 

A  V„(i  +  k)  —  A  V„(i)  ^  [J(n,i  +  k,p2)  -  J(n,i  +  k  -  l,p2)l  -  [J(n,i,p\)  -  J(n,i  -  l.pi)] 

=  A h(i  +  k)  -  A/i (/') 

+  A  y„-\(i  +  2 k)\(p2) 

+  A  y„-\(i  +  k)(A  -  m  -  X(p2)  -  X(pi)) 

+  A  y„-\(i  +  k  -  Dm 
+  A  y„-\(i)(— A  +  m  +  k(p i)) 

^  A  y„-i(i  +  k  -  1)m  ^  0,  where  the  next  to  last  inequality  follows  from  Assumption  (3),  the 
inductive  hypothesis  and  the  definition  of  A,  and  the  last  inequality  follows  from  Lemma  1. 

For  /  =  l,a  similar  argument  combined  with  the  result  for  /  >  1  yields  the  desired  result.  For 

«  =  +  <»,  the  result  obtains  by  taking  the  limit  as  n  — *  +°°  on  the  finite  case.  ■ 

We  now  establish  our  first  main  result, 

THEOREM  2.  The  optimal  entrance  price  is  a  nondecreasing  function  of  /  €  S,  i.e., 
pHi  +  1)  >  p*(i). 

PROOF.  From  Section  2,  it  suffices  to  show  that  J  is  submodular  on  S  x  P.  Let  p\  $  p2. 
Define  A,p  by 

-  JU,P\)  +  J(i  +  L P2)  ~  JU,p2)  -  J(i  +  \,P\). 

Then  from  (9),  for  /  >  0. 

A/p-  (A  y(i  +  1)-A  V(i  +  k  +  1  ))(X(p,)-X(p2)) 

<  0,  where  the  inequality  follows  from  Assumption  (D  and  Lemma  2.  when  /  -  0  we  must 
use  (8)  and  (9)  and  the  same  computation  results.  To  establish  the  general  case 
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J(i.P i)  +  JU  +  j.Pi )  _  Jd.Pi)  ~  J (/  +  7,P;)  <  0.  Note  that  this  expression  can  be  written  as 

^  A,+/p  each  term  of  which  is  nonpositive  by  the  case  above.  ■ 

/- o 

Thus,  the  greater  the  "load"  on  the  server  (as  measured  by  /  €  S),  the  greater  is  the  price 
charged  for  admission  to  the  system  and  thus,  the  lower  the  arrival  rate.  From  the  definition  of 
the  state  variable  we  conclude  that  the  optimal  entrance  fee  p*  is  nondecreasing  in  both  the 
number  of  customers  in  the  system  and  the  number  of  phases  of  service  remaining  on  the  cus¬ 
tomer  currently  being  served.  Along  these  lines,  as  a  special  case,  we  can  take  li  to  be  a  func¬ 
tion  only  of  the  number  of  customers  in  the  system  by  defining  //  to  be  constant  on  each  seg¬ 
ment.  [(H).  U,  2k),  ...  and  requiring  that  the  restriction  of  It  to  the  set  {0,k,2k,  . . .)  be  con¬ 
vex.  It  is  easily  verified  that  all  proofs  go  through. 

It  would  be  of  interest  to  extend  Theorem  2  to  the  truly  finite  horizon  CTMDP.  However 
we  have  thus  far  been  unable  to  do  this.  Using  results  of  Lippman  ([8]— see  the  connectedness 
condition  on  page  483  and  also  Theorem  4  of  that  paper)  the  monotonicity  of  p*  for  the  finite 
time  horizon  would  follow  if  we  could  prove  that  J  is  submodular  in  n  and  p,  and  this  in  turn 
would  follow  from  A  Vn(i)  ^  A  F„_,(/).  We  have  been  unable  to  establish  this  last  inequality 
(or  its  reverse  which  would  also  be  sufficient). 

4.  THE  LOW-LIPPMAN  MODEL  WITH  STATE-DEPENDENT  SERVICE  RATES 

Once  again  we  consider  Lippman’s  model  [6,  p.  707],  this  time  incorporating  a  state- 
dependent  service  rate  and  a  state-dependent  service  cost.  Specifically  we  have  an  M/M/l 
queue  with  queue  capacity  Q ,  where  Q  <  +  °°.  An  entrance  price  p  is  charged  and  the  arrival 
rate  k  is  a  function  of  p.  The  state  of  the  system  is  the  number  of  customers  present  and  is 

denoted  by  i  €  S  =  {0,  1 . Q ).  Given  the  state  is  /,  a  holding  cost  //(/)  per  unit  time  is 

incurred.  Furthermore,  we  shall  assume  that  the  server  works  at  rate  /*,  and  incurs  a  service 
cost  C(Ml)  per  unit  time  when  the  state  is  /'. 

The  functional  equations  corresponding  to  (6),  (7),  (8),  and  (9)  are,  for  Q  =  4-  °°, 

yo0)  =  0 

V„ (/)  =  min  J(n.i,p)/( A  +  a),  n  >  0 

For  /  =  0,  0  <  n  <  +  °° 

J(n,0,p)  =  -p\(p)  +  k(p)  V„_ | ( 1 ) 

+  F„_,(0)(A  -  A (/?)) 

and  for  /  >  0, 

J(n.i.p)  =  /;(/)  +  C(p.,)  -  pk(p)  +  X(a»)  F„_|(/  +  1) 

+  ft,  V„-\(i  ~  1)  +  F„_|(/)(A  -  fi,  -  k(p)) 

when  Q  <  +  and  i  =  Q,  the  transition  /'  —  /'  +  1  has  rate  zero.  In  addition  to  Assumptions 
(1)  and  (2)  of  Section  3  above,  we  make  the  following: 

(2b)  When  Q  <  +  °o  we  make  the  convention  that  p*(Q)  =  sup  \p  €  P)  and 
k(p'(Q))  =0. 
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(3)  h  is  nondecreasing  and  convex  in  i  €  5  =  {0, 1,2 . 0} 

(4)  C(p.,)  is  convex  increasing  in  i  €  S  *  (0, 1,2 . 0).  (As  a  special  case  we  may 

take  =  0). 

(5)  n,  is  concave  in  /  and  satisfies  2fi,  -  fi2  ^  0-  When  Q  <  +  °o  we  require  addi¬ 
tionally  that  2p  q  -  hq  >  sup  {k  (/>)}.  When  Q  =  +  °°,  in  order  that  A  be  finite 
we  assume  that  is  non-decreasing  in 

When  ft,  is  taken  to  be  non-decreasing  in  /,  this  may  be  interpreted  in  terms  of 
the  server  choosing  to  increase  the  service  rate  (at  higher  costs  in  case 
>  0).  The  case  where  n,  is  non-increasing  in  i  would  represent  a  system 
where  the  effectiveness  of  the  service  facility  decreases  as  the  work  load  increases. 

We  may  now  prove  the  analogs  of  all  the  previous  results.  The  proofs  obtain  by  employing  the 
same  techniques  as  before.  We  remark  that  A  is  defined  to  be  the  sup  {A  (p )  +  <  4-  °°  (by 

i.P 

Assumptions  (2)  and  (5)).  Lemma  1  is  straightforward.  Lemma  2  is  restated. 
A  y„(i)  >  A  V„(i  -  1)  j.e.,  V„  is  convex  in  /.  To  establish  this  requires  Assumptions  (4)  and 
(5).  In  Theorem  2,  we  establish  submodularity  as  before;  when  Q  <  +  °o  we  observe 
Assumption  (2b).  Furthermore,  for  this  model  we  may  prove  that  p*  is  non-decreasing  in  /  for 
the  finite  horizon  continuous  time  problem  as  follows.  We  first  show  that  p*  is  non-increasing 
as  a  function  of  n  by  establishing  that  J  is  submodular  in  n  and  p.  This  last  condition  will  fol¬ 
low  from  AK„(/)  >  Af„.|(i)  which  is  easily  proved.  We  then  invoke  Lippman's  connected¬ 
ness  condition  to  establish  Theorem  4  of  (8)  (see  page  483).  In  every  instance  we  observe  the 
various  special  cases  (/  =  0,  /=(?<+  °o).  We  summarize  these  remarks  in  the  following 
Theorem. 

THEOREM  3.  In  the  Low-Lippman  Model  [6,  p.  707]  with  state-dependent  service  rates 
and  service  costs,  under  Assumptions  (1)  through  (5),  and  for  finite  or  infinite  queue  capacity, 
in  both  the  finite  and  infinite  time  horizon  CTMDP,  the  optimal  entrance  price  is  a  non¬ 
decreasing  function  of  the  number  of  customers  in  the  system. 

We  note  that  the  special  case  C  =  0  and  h  linear  was  treated  by  Lippman  and  Stidham 
[9]. 

5.  TOPICS  FOR  FURTHER  RESEARCH 

We  have  thus  far  been  unable  to  entend  Theorem  2  to  either  the  finite  queue  capacity 
case  or  to  the  state-dependent  service  rate  model.  A  few  numerical  computations  have  been 
made  but  no  counter-examples  yet  found.  This  might  be  a  research  topic  of  interest  to  other 
authors  working  in  the  area. 

In  general  the  technique  of  transforming  a  CTMDP  to  a  DTMDP  appears  to  be  a  very 
effective  tool  for  analyzing  birth-death  processes.  However,  when  more  general  state  transi¬ 
tions  are  allowed,  complications  seem  to  appear.  The  author  in  [1]  has  had  some  success  in 
this  respect  in  treating  various  machine  maintenance  models.  Another  difficult  class  of  control 
problems  (using  the  transformation  technique  CTMDP  —  DTMDP)  are  Markov  decision 
processes  involving  two  dimensional  state  spaces  (such  as  one  would  have  in  treating  tandem 
queues,  or  the  operation  of  two  machines,  for  example).  This  might  be  a  rich  field  of  research 
for  authors  interested  in  applying  the  new  methods  employed  in  this  paper  for  analyzing  Mar¬ 
kov  processes  with  structure. 
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ABSTRACT 

The  dynamic  transportation  problem  is  a  transportation  problem  over  time. 

That  is,  a  problem  of  selecting  at  each  instant  of  time  /,  the  optimal  flow  of 
commodities  from  various  sources  to  various  sinks  in  a  given  network  so  as  to 
minimize  the  total  cost  of  transportation  subject  to  some  supply  and  demand 
constraints.  While  the  earliest  formulation  of  the  problem  dates  back  to  1958 
as  a  problem  of  flnding  the  maximal  flow  through  a  dynamic  network  in  a 
given  time,  the  problem  has  received  wider  attention  only  in  the  last  ten  years. 

During  these  years,  the  problem  has  been  tackled  by  network  techniques,  linear 
programming,  dynamic  programming,  combinational  methods,  nonlinear  pro¬ 
gramming  and  finally,  the  optimal  control  theory.  This  paper  is  an  up-to-date 
survey  of  the  various  analyses  of  the  problem  along  with  a  critical  discussion, 
comparison,  and  extensions  of  various  formulations  and  techniques  used.  The 
survey  concludes  with  a  number  of  important  suggestions  for  future  work. 


1.  INTRODUCTION 

The  classical  transportation  problem  refers  to  the  shipment  of  goods  from  a  set  of  sources 
to  a  collection  of  sinks  at  a  minimum  cost.  Analysis  of  these  problems  dates  back  to  the  works 
of  Kantorovich  [28],  Hitchcock  [26],  Koopmans  [31],  and  Dantzig  [9],  Subsequently,  much 
work  has  been  done.  (See  e  g.,  Holladay  [27],  Potts  and  Oliver  [35],  Bradley  [5],  Christofides 
[7],  and  Kennington  [29].) 

The  classical  problem  is  static  in  nature  in  the  sense  that  shipments  are  instantaneous,  and 
costs  as  well  as  supply  and  demand  requirements  are  independent  of  time.  A  notational  formu¬ 
lation  of  the  static  transportation  problem  appears  in  Section  2. 

That  the  dynamics  of  the  transportation  problem  are  important  was  first  recognized  by 
Ford  [15]  in  his  formulation  of  the  maximal  dynamic  flow  problem.  (See  also  Gale  [19]  and 
Ford  and  Fulkerson  [16].)  Since  then,  a  number  of  other  dynamic  extensions  have  been  stu¬ 
died.  These  include  the  minimum-time  transportation  problem  (Szwarc  [44,45,46],  Hammer 
[23],  Tapiero  and  Soliman  [51],  Tapiero  [49],  and  Srinivasan  and  Thompson  [41]);  the 
minimum-cost  transportation  problem  (Bellmore,  Eklof,  and  Nemhauser  [2],  Szwarc  [47],  Tapi¬ 
ero  [48],  Tapiero  and  Soliman  [51],  and  Srinivasan  and  Thompson  [41,42]);  and  the  maximal 
dynamic  flow  problem,  both  for  a  single  commodity  (Ford  [15],  Ford  and  Fulkerson  [16])  and 
in  the  multi-commodity  case  (Bellmore  and  Vemuganti  [3]). 
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It  is  the  purpose  of  the  present  paper  to  survey  the  above  work  as  well  as  some  applica¬ 
tions  of  the  dynamic  transportation  problem.  A  schematic  representation  of  the  various  treat¬ 
ments  of  this  problem  is  given  in  Figure  1. 


Figure  I 


The  organization  of  our  survey  is  as  follows.  Section  2  establishes  the  basic  notation  and 
points  out  some  of  the  possible  ways  in  which  the  static  problem  can  be  generalized  to  the 
time-dependent  case.  Sections  3,  4,  and  5  deal  with  the  case  of  discrete  time,  and  continuous 
time  is  treated  in  Section  6. 

Section  3  is  concerned  with  the  deterministic  dynamic  transportation  problems,  i.e.,  the 
minimum-time,  minimum-cost,  and  the  maximal  dynamic  flow  problems.  We  consider  in  Sec¬ 
tion  4  a  stochastic  multiperiod,  multimode  model  (Midler  [34]),  a  problem  of  discrete-time  sto¬ 
chastic  optimal  control  which  is  solved  via  a  dynamic  programming  algorithm. 

Section  5  deals  with  two  illustrative  applications  of  the  discrete  time  dynamic  transporta¬ 
tion  problem.  One  involves  a  multiperiod  truck  delivery  problem  (Hausman  and  Gilmour 
[25]),  and  the  other  studies  the  optimal  rescheduling  of  air  traffic  in  response  to  stochastic 
influences  (Ellis  and  Rishel  [II]). 

Section  6  is  concerned  with  the  dynamic  transportation  problem  over  continuous  time, 
and  moreover,  when  explicit  account  is  taken  of  the  time  delay  between  dispatch  of  a  shipment 
at  the  sink  and  its  receipt  at  the  source.  The  appropriate  framework  is  that  of  optimal  control 
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theory,  and  papers  surveyed  include  those  of  Frank  [17),  Frank  and  El-Bardai  (18),  Tapiero  and 
Soliman  [51 1,  and  Tapiero  [49], 


Section  7  concludes  the  paper  with  extensive  outlines  and  suggestions  for  future  research. 
Portions  of  this  work  are  already  under  way  (Sethi  and  Bookbinder  [381). 


2.  STATIC  TRANSPORTATION  PROBLEM 


By  the  static  transportation  problem,  we  shall  mean  the  well-known  linear  programming 
problem 


subject  to 


Min  7  =  £  £  q.jU.j, 

/-I  7-1 


II 

£  u „  =  A ,  (Supply  constraints) 

/-i 

in 

£  u,,  =  B,  (Demand  constraints) 
i-i 

0  <  u i,  ^  c,i  (Capacity  restrictions). 


In  this  notation  (which  has  been  chosen  to  conform  with  that  of  the  control-theoretic  treatment 
of  transportation  problems),  u0  is  the  shipment  from  origin  /  to  destination  j,  at  a  cost  of  qir 
Each  source  node  /  has  a  supply  Ah  the  sink  nodes  j  require  an  amount  Bn  and  so  a  transporta¬ 
tion  schedule  is  to  be  found  that  minimizes  the  total  shipping  costs  while  not  exceeding  the 
capacity  c„  of  any  arc  (ij).  It  should  be  noted  that  the  above  problem  refers  to  the  shipment 
of  a  single  commodity  (or  at  least  only  a  standard  "product  mix")  and  only  a  single  mode  of  tran¬ 
sportation  is  considered  for  each  link  (ij). 


There  are  a  number  of  possible  ways  in  which  this  problem  can  be  made  time-dependent: 

2.1.  A,  and/or  B ,  Functions  of  Time 

If  the  supply  capabilities  or  the  demand  requirements  are  time-dependent,  then  the  production 
schedule  can  be  smoothed  by  incurring  the  costs  of  holding  inventory  which  was  delivered 
early.  Inventory  variables  could  be  allowed  at  sources  as  well  as  sinks,  and  there  may  be 
included  upperbounds,  possibly  time-dependent,  on  those  inventories. 

2.2.  q „  Functions  of  Time 

The  remarks  of  the  preceding  paragraph  are  applicable  here  as  well. 

2.3.  Associate  a  Time  t„  with  Each  utj 

The  notation  r,y  refers  to  the  time  required  to  ship  the  u,j  units  over  the  route  (/J),  so  that  the 
relevant  problem  is  the  minimization  of  the  time  needed  to  satisfy  all  the  demand  require¬ 
ments.  It  should  be  noted  that  the  costs  qtJ  would  not  normally  enter  into  this  problem 
(though  they  could  of  course  be  taken  into  account  via  subsidiary  constraints;  see  Glickman 
and  Berger  [21],  and  also  Srinivasan  and  Thompson  [42]). 


68 


J.H.  BOOKBINDER  AND  S  P  SETHI 


A  convenient  division  of  the  papers  in  this  survey  is  according  to  whether  the  analytical 
techniques  involve  discrete  time  or  continuous  time.  We  shall  begin  with  the  former,  and  in  fact 
with  a  problem  of  type  3  above. 

3.  DETERMINISTIC  DYNAMIC  TRANSPORTATION  PROBLEM:  DISCRETE  TIME 
3.1.  The  Minimum-Time  Transportation  Problem 

One  is  interested  in  minimization  of  the  total  time  required  to  transport  all  the  goods 
from  the  origins  to  their  destinations,  i.e., 

Min  t*  =  (maxfr,,)!//,,  >  0’  . 

Possible  applications  include  the  transport  of  perishable  goods;  the  movements  of  military  units 
from  bases  to  fronts;  or  the  shipment  of  customers’  orders  which  are  of  a  priority  or  "rush" 
nature. 

The  minimum-time  transportation  problem  has  been  considered  by  Szwarc  (44,45,46]  and 
Hammer  (23,241.  (See  also  Garfinkel  and  Rao  (20]  and  Srinivasan  and  Thompson  (41].) 
Szwarc  [451  modified  his  original  (44]  algorithm  to  prevent  it  from  cycling.  Szwarc  (461  in  a 
similar  way  refined  Hammer's  [231  algorithm  and  revised  a  proof  concerning  the  equivalence  of 
local  and  global  optima.  Hammer  [241  also  furnished  some  amendment  of  his  own  work. 

Szwarc  [46]  has  shown  that  his  algorithm  [451  is  essentially  equivalent  to  that  of  Hammer 
[23],  By  this,  we  mean  they  both  produce  the  same  sequence  of  basic  feasible  solutions,  given 
that  they  begin  with  the  same  initial  solution.  The  Szwarc-Hantmer  algorithm  is  a  primal  algo¬ 
rithm,  but  Garfinkel  and  Rao  [201  employ  a  "threshold"  algorithm  that  yields  a  primal-feasible 
solution  only  upon  termination.  Srinivasan  and  Thompson  [411  have  studied  the  Bottleneck 
Time  and  Bottleneck  Shipment  transportation  problems,  and  have  furnished  an  algorithm  for 
the  former  (i.e.,  for  the  minimum-time  transportation  problem)  that  is  virtually  identical  to  the 
Szwarc-Hammer  algorithm.  Srinivasan  and  Thompson  [41]  have  also  shown  that  their  algo¬ 
rithm,  and  hence  the  Szwarc-Hammer  algorithm,  is  computationally  more  efficient  than  that  of 
Garfinkel  and  Rao  [20], 

The  main  points  of  the  Szwarc-Hammer  algorithm  involve  the  following  steps; 

1.  Find  an  initial  basic  feasible  solution  (BFS)  to  the  problem.  This  is,  of  course,  a  basis 
of  order  (m  +  n  -  1).  Let  /  denote  the  current  value  of  the  bottleneck  time. 

2.  Find  an  adjacent  (i.e.,  a  change  of  only  1  basis  element)  BFS  which  is  better  (i.e., 
either  t  is  lower,  or  if  t  is  unchanged,  there  is  a  smaller  quantity  shipped  on  the  bottleneck  arc). 
Hammer  gives  a  4-step  procedure  by  which  this  can  be  done. 

(i)  Among  these  basic  arcs  (i.j)  for  which  r,;  =  t,  identify  the  maximum  quantity 
shipped.  Call  the  quantity  uhk . 

(ii)  Determine  the  set  Shk  of  all  non-basic  elements  (p.t/),  such  that  if  uw  were  intro¬ 
duced  into  the  basis,  the  shipment  uhk  would  thereby  be  reduced. 

(iii)  Choose  among  the  elements  of  Shk  the  one,  say  (/>0,<70),  for  which  rw  is  the 
smallest. 
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(iv)  Enter  up^t)  into  the  basis,  as  in  the  usual  transportation  problem.  Update  /. 
(Szwarc,  at  this  stage,  eliminates  from  further  consideration  any  non-basic  uu  for 
which  r„  >  /.) 

3.  Check  for  optimality,  i.e.,  for  the  existence  of  a  still  better  BFS.  Szwarc’s  procedure 
follows  from  his  Theorem  2,  which  as  above,  indicates  optimality  when  the  only  vacant  "cells" 
u,j  available  for  entry  into  the  basis  turn  out  to  have  t(>  ^  r. 

4.  Return  to  step  2  or  terminate  at  optimal  solution. 

We  remark  that  the  preceding  treatments  have  emphasized  the  time  objectives  with  little 
mention  of  the  concomitant  costs.  Hammer  [23)  did  give  an  extension  of  his  algorithm  to 
determine,  from  among  the  alternative  optimal  solutions  with  the  given  /*,  that  solution  with 
minimum  cost.  An  interesting  extension  would  be  the  imposition  of  an  additional  constraint 
involving  the  upper  bound  of  total  cost.  Alternatively,  Glickman  and  Berger  [21]  have 
analyzed  the  trade-off  between  the  cost  and  time  of  transport,  by  solving  the  usual  minimum 
cost  problem,  subject  to  an  upper  bound  on  the  time  of  transportation.  Srinivasan  and  Thomp¬ 
son  [42]  have  determined  cost/time  efficient  frontiers  for  the  multimodal  problem.  For  every 
Pareto-optimal  point,  their  algorithm  furnishes  the  routes,  modes,  and  quantities  shipped. 

3.2.  Multiperiod  Transportation  Problem 

Attention  is  now  addressed  to  the  problem  of  minimization  of  total  costs  when  the  n  sinks 
have  demands  for  each  of  T  periods.  This  problem  has  been  studied  by  Bellmore,  Eklof,  and 
Nemhauser  [2]  (B.E.N.),  and  by  Sz ware  [47],  (The  latter  reference  first  appeared  in  1967  as  a 
CORE.  Discussion  Paper,  No.  6704.) 

B.E.N.  treat  the  general  case  in  which  An  Bh  and  are  all  functions  of  time,  which  is 
denoted  by  the  presence  of  a  superscript  t.  Shipments  to  the  sinks  from  the  m  sources  can 
either  take  place  in  the  same  period  as  the  demand  requirement,  corresponding  to  a  shipping 
cost  of 

Q'j  K' 

or  earlier ,  in  which  case  a  linear  cost  for  holding  inventory  is  then  also  incurred.  If  the  notation 
yj  denotes  the  inventory  carried  from  period  (r  —  1 )  to  period  t  at  sink  j,  and  dj  the 
corresponding  inventory  holding  cost,  the  problem  of  B.E.N.  can  then  be  formulated  as: 

Min  y  =  £  x  q'j  ub  +  £  L  d'i  yj 

i.j  l- 1  j- 1  1-2 

subject  to: 

£  u!j  <  A!,  v  /,  / 

j- 1 
m 

I  q'j  +  y]  -  yj*'  >  B).  vy.  t 

i- 1 

tiij  >  o 

0  <  y'j  <  /V/, 

where  N)  is  the  maximum  inventory  which  can  be  stored  at  sink  j  between  periods  (f  —  1)  and 
t. 
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Wore  n  not  lot  i ho  inclusion  of  iho  upiH'rhounds  V,\  the  above  problem  could  bo  solved 
hv  creation  ol  a  largo  noi.o/o  /hv/ik/  prohlom.  with  t m/>  sources  anil  ( nl )  sinks  (bowman  |4| 
aiul  Kroihol  I'.'ll  I  Ins  approach,  as  H  I  N  point  out.  yields  computet  storage  icquircmcnls 
that  are  ptopoiiion.il  to  / 

H  I  N  approach  the  problem  through  a  muwniiiH  cost  How  01  iittnsshii»nrni  model,  i  e  . 
Ibn  source,  n  sink*  pnrhloms  linked  by  inventory  variables  ( Iho  "nodes"  are  the  I' and 
H  lor  all  values  ol  i  I  I  he  compuiei  stoiage  rci|iurcmcnts  arc  then  proportional  to  /  III  N 
also  point  out  a  ./<  i  ow/H'M/io/i  ol  the  transshipment  problem  that  requires  rapid  access  compuiei 
storage  ol  data  lot  only  n  more  nodes  and  n  more  arcs  than  does  the  standard  transput  tation 
problem  I  hat  is.  the  stoiage  rouuircmcnts  for  the  decomposition  are  then  independent  ol  / 
(and,  m  lad,  arc  at  anytime  comprised  of  the  data  for  a  single  period  in  core) 

I  hc  K  I  N  transshipment  algorithm  is  olfcctively  composed  of  the  following  steps 

I  \dd  an  artificial  source  node  s’.  and  an  artificial  sink  note  r* 

Introduce  arc  "distances",  which  are  essentially  «//<  01  ./ .  suitably  modified  b\  the 
existing  How  m  the  network  (Initial  Mow  —  0  ) 

)  find  a  shortest  path  ft  out  s’  to  /*.  and  ship  the  maximum  amount  possible,  con 
sistem  with  the  upperbounds  on  inventories 

•I  Increase  Hows,  revise  "distances",  and  re  calculate  shortest  path 

>  Mgonlhm  icimmales  when  all  demands  have  been  satisfied 

I  he  papei  of  S/warc  I -I  ’I  is  addressed  to  the  same  problem  as  that  of  H  I  N  .  but  ddVeis 
in  that  S/warc  does  not  include  an  inventory  holding  cost  It  I  N  include  this  cost,  as  well  as 
upperbounds  on  inventories  at  each  sink  node  Neither  S/warc  nor  14 1  N  include  uppei 
bounds  on  the  shipments  u  .  although  as  It  I  N  point  out.  the  modification  to  the  transship 
ment  algorithm  in  this  case  is  clem 

It  may  be  argued  that  the  It  I  N  ’s  desire  to  reduce  the  core  requirements  was  motivated 
by  desires  more  academic  than  practical,  in  that  few  real  problems  would  assume  as  icgulat  oi 
straightforward  a  form  However,  the  work  of  It  I  N  should  be  quite  practical  for  a  more  com 
plicated  problem  with  additional  constraints,  since  the  It  I  N  solution  could  be  used  lot  the  mi 
tial  iteration  to  the  largei  problem 

It  !■'  N  treated  -I  .  H’,  and  S  as  given,  and  determined  optimal  shipments  u  and  inven 
tones  vf  to  mininu/e  the  sum  of  the  latter  two  costs  over  the  horizon  t  -  I..’,  .  /  lot 
intermediate  values  of  /',  related  questions  are  how  the  prduclion  at  each  plant  should  be 
increased  over  time  and  how  the  shipping  patterns  should  change,  when  market  demands  have 
given  growth  rates  and  there  is  thus  an  increase  in  the  total  volume  handled  by  the  system 
Such  optimal  growth  paths  in  logistics  operations  have  been  studied  hv  Srimvasan  and  I  homp 
son  140|  when  supply  costs  are  linear  or  piecewise  linear  and  convex  for  still  largei  /.  I  ong 
and  Srimvasan  1 1 4|  have  considered  the  multiperiod  capacity  expansion  problem  when  market 
demands  arc  nondecreasing  over  tune  and  must  be  met  exactly  during  each  period  In  the  case 
of  linear  costs,  they  furnished  an  effluent  algorithm  to  schedule  capacity  expansions  and  ship 
ments  to  markets  to  minimize  the  discounted  capacity  expansion  costs  plus  the  costs  of  produc 
lion  and  transportation 
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3.3.  Maximal  Dynamic  Flows 


3.3.1.  Single  Commodity 


The  problem  of  maximal  dynamic  flow  requires  the  determination  of  the  largest  commo¬ 
dity  flow  v(D  which  can  occur  between  source  and  sink  within  a  specified  time  horizon  T.  A 
capacity  ct)  and  a  minimum  transit  time  r0  are  associated  with  each  arc  (ij).  Let  ujj  be  the 
flow  leaving  node  /  at  time  /,  enroute  to  node  j  ;  for  the  deterministic  case,  arrival  at  node  j  will 
occur  at  time  (r  +  r„).  Transshipment  is  allowed,  with  t/,'  denoting  the  commodity  inventory 
at  node  /  held  over  from  time  /  to  (r  +  1).  (It  turns  out,  however,  that  there  always  exists  a 
maximal  dynamic  flow  in  which  there  are  no  holdovers  at  intermediate  nodes.) 

Letting  P,  Q  denote  the  source  and  sink  nodes,  respectively,  the  problem  is  then: 


subject  to 


Maximize  v(D 


Z  Z  hb  ~  VV)  =  v(T) 

1-0  j 

z  H  -  Uj~T "]  =  0;  /  *  P.  Q-  t  -  0,  1 .  T 

j 

Z,  Z  ["«/  -  “jqTj01  =  -v(D 

1-0  j 

o  <  u(i  <  Cjj . 

This  problem  has  been  studied  in  detail  by  Ford  (15)  and  by  Ford  and  Fulkerson  (16], 
Their  analysis  is  based  upon  a  Time  Expanded  (T.E.)  network,  in  which  a  new  node  is  intro¬ 
duced  for  each  (discrete)  time  period.  The  maximal  dynamic  flow  is  obtained  by: 


1.  Solving  a  static  transshipment  problem  in  the  T.E.  network. 

2.  Repeating  this  flow  for  successive  time  periods. 

Ford  and  Fulkerson  showed  that  this  maximal  temporally  repeated  flow  is  the  optimal  dynamic 
flow. 


3.3.2.  Multicommodity 

Ford  and  Fulkerson's  result  cannot  be  extended  to  more  than  one  commodity,  that  is.  to 
the  case  where  the  upperbounds  c(/  refer  not  to  each  commodity  separately  but  rater  to  the  sum 
of  the  u,Jk  over  all  commodities  k.  Bellmore  and  Vemuganti  [3]  (B.V.)  have  used  the  fact  that, 
while  the  temporally  repeated  multicommodity  flow  need  not  be  optimal,  it  is  feasible.  This 
furnishes  a  lower  bound  on  the  maximal  dynamic  flow. 


B.V. 
nished  by 


showed  that,  as  the  number  of  time  periods  7  becomes  large,  an  upper  bound  is  fur- 

[Multi.  Comm.  Max.  Dyn.  Flow  for  T periods) 

-[Max.  temporally  repeated  flow  for  T periods)  ^  a. 
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The  number  a  is  independent  of  T,  so  that  for  large  T ,  the  percentage  difference  between  the 
two  terms  on  the  left  tends  to  zero.  B.V.  show  how  a  can  be  obtained  by  solving  the  mul¬ 
ticommodity  transshipment  problem  on  the  static  network. 

The  bounds  a  are  refined  by  B.V.  through  the  use  of  duality.  As  they  note,  their  method 
and  results  generalize  to  the  case  of  the  weighted ,  multicommodity  maximal  dynamic  flow  prob¬ 
lem.  Nevertheless,  as  far  as  known  to  the  present  authors,  no  exact  algorithm  has  been 
presented  for  the  problem  of  maximal  dynamic  flow  in  the  multicommodity  case. 

4.  A  STANDARD  MULTIPERIOD,  MULTIMODE  MODEL 

Our  next  illustrations  for  the  case  of  discrete  time  incorporate  uncertainty  in  the  demand 
requirements  Br  For  the  single  stage  problem,  treatments  by  Dantzig  [10],  Williams  [52]  and 
Szwarc  [43]  have  employed  stochastic  linear  programming. 

In  the  multistage  case,  the  probabilistic  version  of  the  transportation  problem  can  be 
thought  of  as  involving  demand  governed  by  a  stochastic  process.  The  paper  by  Midler  [34] 
considers  such  a  multiperiod  problem  with  random  demands  for  the  multicommodity  case.  A 
dynamic  programming  algorithm  is  used  to  decide  which  mode  of  transport  to  employ,  and  to 
assign  commodity  classes  to  various  modes  and  supply  points  to  destinations.* 

The  problem  is  formulated  as  one  of  discrete  time  stochastic  optimal  control,  in  which  the 
objective  is  minimization  of  total  expected  cost  over  the  planning  horizon.  The  costs  con¬ 
sidered  are  those  of  shipping,  by  commodity  and  route;  costs  of  rerouting  a  carrier  from  a  desti¬ 
nation  back  to  each  origin;  costs  of  stockouts  and  of  carrying  inventories  at  destinations;  and 
costs  involving  deviations  from  port  capacities.  Midlev  assumed  that  the  shipping  costs  are 
linear,  with  all  other  costs  quadratic. 

The  dynamic  equations  were  taken  to  be  linear  in  the  state  and  control  variables,  and  the 
model  solved  in  closed  form.  Midler  found  that  the  optimal  values  of  the  control  variables 
could  be  expressed  linearly  in  terms  of  the  current  values  of  the  state  variables,  with  the 
coefficients  in  this  relationship  depending  upon  the  number  of  periods  remaining  in  the  hor¬ 
izon.  Midler  then  furnished  an  extensive  discussion  of  the  solution  to  the  transportation  prob¬ 
lem,  including  the  optimal  dependence  upon  time  and  upon  location  of  the  assignment  of  each 
commodity  class  to  a  mode,  and  the  interpretation  of  the  time-varying  stochastic  shadow  prices. 

5.  SOME  APPLICATIONS 

We  now  discuss  two  problems  which  have  been  formulated  as  applications  of  one  form  or 
other  of  the  dynamic  transportation  problem  with  discrete  time. 

5.1  A  Multiperiod  Truck  Delivery  Problem 

The  first  application  concerns  the  scheduling  of  truck  deliveries.  This  problem,  studied  by 
Hausman  and  Gilmour  [25],  arose  in  the  analysis  of  delivery  of  home  heating  oil.  Deliveries 
are  made  from  a  single  source  to  n  customers,  each  of  which  has  a  minimum  required  fre¬ 
quency  of  service  f.  This  frequency  /,,  which  may  be  exceeded  to  take  advantage  of  truck 
economies  of  scale,  furnishes  the  time-dependence  of  the  problem  at  hand. 


*For  the  case  of  a  dynamic  communication  network  with  stochastic  inputs.  Scgall  and  Muss  |J7|  have  furnished  an 
analysis  (in  the  continuous-time  framework)  which  uses  Hamilton  Jacobi  theory  (see  eg  .  Fleming  and  Rishcl  IUM 
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As  an  operating  policy,  the  customers  are  classified  into  k  groups ,  where  k  is  treated  as  a 
decision  variable.  When  any  customer  in  a  group  needs  service,  the  whole  group  is  served. 
The  objective  function,  then,  is  the  assignment  of  each  customer  to  a  group,  so  that  annual 
delivery  costs  are  minimized. 

The  truck  costs  involve  both  a  fixed  cost  per  delivery,  as  well  as  a  variable  cost  per  mile 
travelled.  The  presence  of  this  variable  cost  means  that  evaluation  of  the  objective  function 
requires  knowledge  of  the  distance  travelled  in  the  optimal  tour  of  each  group.  This,  of  course, 
amounts  to  solving  the  travelling  salesman  problem ,  in  general  a  difficult  task  indeed. 

Hausman  and  Gilmour  took  the  point  of  view  that  the  customers  in  a  group  were  likely  to 
be  reasonably  close  geographically.  The  truck  driver  would  then  intuitively  choose  a  sequence 
of  deliveries  which  involved  a  distance  D  fairly  close  to  that  in  the  optimal  tour,  at  least  if  there 
were  not  an  "excessive”  number  of  customers  per  group.  (The  latter  is  guaranteed  by  the  lim¬ 
ited  capacity  of  the  truck.)  Accordingly,  these  authors  estimated  Dj  for  each  group  j  by  multi¬ 
ple  regression.  This  heuristic  device  employed  as  independent  variables  three  simple  statistics 
for  the  group  under  consideration.  These  variables  involved  the  standard  deviations  of  the  lati¬ 
tudes  and  longitudes,  and  the  number  of  customers  in  the  group.  Parameters  were  determined 
by  comparison  with  some  randomly  generated  travelling  salesman  problems  and  their  optimal 
solutions. 

Hausman  and  Gilmour’s  algorithm  began  with  this  approximation  D,  for  each  group,  fol¬ 
lowed  by  a  search  for  a  re-assignment  of  customer  groupings  in  an  attempt  to  lower  the  total 
costs.  While  the  problem  at  hand  is  quite  difficult,  their  algorithm  does  not  appear  to  be  very 
efficient,  because  customers  are  one-by-one  temporarily  re-assigned  to  each  of  the  other  groups. 

Nevertheless,  there  are  two  interesting  aspects  to  this  work.  The  regression  approach  (to 
estimate  Dj)  could,  as  Hausman  and  Gilmour  point  out,  be  useful  in  estimating  bounds  on  the 
optimal  solution  to  other  problems,  or  in  deciding  whether  to  continue  with  further  searches  on 
a  given  problem. 

Moreover,  in  the  course  of  their  work,  Hausman  and  Gilmour  found  that  they  could  still 
obtain  a  significant  cost  reduction  (and,  of  course,  considerable  saving  in  computer  time)  if  the 
area  under  consideration  were  partitioned,  and  these  sub-problems  treated  independently.  It 
would  be  worthwhile  to  obtain  conditions  on  the  validity  of  this  "geographical  decomposition." 

S.2.  Optimal  Rescheduling  of  Airplanes 

We  next  turn  attention  to  the  possibility  of  adjusting  a  previously-scheduled  flow,  when 
stochastic  disturbances  warrant  such  adjustment.  Ellis  and  Rishel  [11]  have  formulated  a  model 
for  the  one-way  flow  of  air  traffic  between  two  airports,  subject  to  random  constraints  on  the 
takeoff  and  landing  capacities.  Their  presentation  was  based  upon  the  state-space  approach  of 
(stochastic)  optimal  control  theory,  but  the  specific  example  which  they  solved  employed  a 
dynamic  programming  algorithm. 

Ellis  and  Rishel  considered  (n  -I-  1)  discrete  time  intervals,  by  the  end  of  which  all 
scheduled  takeoffs  from  airport  1  will  have  actually  taken  off  and  landed  at  airport  2.  Once 
takeoff  occurs,  (a  deterministic)  j  intervals  of  time  are  required  to  fly  from  1  to  2.  During  any 
time  interval,  the  number  of  aircraft  movements  which  a  controller  can  permit  is  limited  by 
runway  or  airside  capacity.  Each  capacity  was  treated  as  a  random  variable  to  model  delays  due 
to  weather  or  mechanical  failure. 
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The  objective  function  is  the  minimization  of  total  waiting  time,  with  time  in  the  air 
valued  higher  by  a  factor  a  >  1 ,  compared  to  waiting  time  on  the  ground.  The  control  vari¬ 
ables  are  the  numbers  of  landings  and  takeoffs  permitted  in  each  time  interval.  The  state  vari¬ 
ables  are  the  numbers  of  planes  currently  in  the  air;  aircraft  awaiting  takeoff;  those  awaiting 
landing;  and  the  current  capacities  of  the  respective  airports. 

To  ensure  with  probability  one  that  by  time  «,  the  schedule  of  movements  begun  at  time 
0  will  have  been  completed,  Ellis  and  Rishel  assume  that  there  exist  positive  upper  and  lower 
bounds  on  the  capacities  at  each  airport.  More  importantly,  the  completion  constraints  at  time  n 
were  shown  to  imply  constraints  on  the  values  of  variables  at  all  intermediate  times.  These 
implied  constraints  define  bounded  convex  sets  giving  the  feasible  values  of  the  state  variables. 
Ellis  and  Rishel  use  this  convexity  to  prove  a  theorem  which  shows  the  existence  of  an  optimal 
control,  and  which  recursively  exhibits  an  optimal  solution  to  the  dynamic  programming  prob¬ 
lem.  They  show  the  optimal  controls  for  a  5-period  numerical  example  which,  although  a  fairly 
simple  problem,  involved  considerable  computation. 

The  remainder  of  the  present  paper  deals  with  the  continuous-time  dynamic  transportation 
problem  with  delays.  In  particular,  the  following  section  deals  with  the  minimum-time  prob¬ 
lems,  as  well  as  minimum  cost  problems  with  delays.  Section  7  concludes  with  a  discussion  of 
some  important  extensions  of  the  dynamic  transportation  problem  with  delays. 

6.  DYNAMIC  TRANSPORTATION  PROBLEM  WITH  TIME  DELAYS 

An  important  class  of  extensions  of  the  classical  transportation  problem  recognizes  the 
fact  that  it  takes  time  for  a  shipment  to  reach  the  sink  after  it  has  left  the  source.  An  appropri¬ 
ate  framework  to  deal  with  such  problems  is  that  of  optimal  control  theory.  Frank  1171  was  first 
to  recognize  this  as  he  attempted  to  examine  a  communication  network  problem  with  finite 
delays  in  the  discrete-time  state-space  framework.* 

6.1.  Frank  (171  and  Frank  and  El-Bardai  (181 

To  describe  Frank’s  formulation,  we  let  G  denote  a  directed  graph  with  each  arc  or  branch 
6,  of  G  having  an  integer  (or  a  rational)  branch  delay  8,.  We  can  assume  8,  =  1,  for  all  i, 
without  any  loss  of  generality.! 

Let  z,(f)  represent  the  flow  in  b,  at  time  t  and  let  z (r)  =  (z|0),  z2{t),  ...,  zm(f)),  where 
m  denotes  the  number  of  branches,  be  called  the  flow  or  state  vector.  Further,  let  «,■(/)  be  the 

external  input  and  y7(/)  be  the  output  at  node  j.  With  u(t)  =  (ut(r),  u2(r) . u„(t ))  and 

y(t)  =  (yi(r).  y2U) . y„(f)),  where  n  is  the  number  of  nodes,  we  can  write  the  state  equa¬ 

tions  as: 

(l)  z(f  +  l)-/(zO).  mO)J. 

(Equation  (1)  continues) 


•Note  that  (he  first  application  of  optimal  control  theory  to  a  static  transportation  problem  is  due  to  Fan  and  Wang 
1121.  They  applied  a  discrete  maximum  principle  to  solve  a  nonlinear  transportation  problem  (of  Bellman  and  Dreyfus 
111)  with  two  sources  with  unlimited  supplies  available  at  increasing  (concave)  cost,  and  multiple  sinks  with  specified 
demands.  Charnes  and  Kortanek  [61  pointed  out  an  inconsistency  in  the  Fan-Wang  procedure.  The  reader  is  referred 
to  Halkin  1221  for  a  precise  statement  of  the  discrete  maximum  principle. 

t If  not,  we  can  construct  a  new  graph  (S'  by  replacing  the  i  th  branch  by  8,/8  series  branches  with  common  time  delay 
6,  where  6  is  the  greatest  common  divisor  of  all  8,.  Note  that  8,  in  turn,  can  be  considered  a  unit  of  time. 


flMNM 


IkMMNftNM 


and 

(1) 


DYNAMIC  TRANSPORTATION  PROBLEM 


75 


>-(/+  l)  =  /[z(f),  «(/)]. 


With  this  model,  Frank  relates  the  idea  of  satisfying  demands  at  the  nodes  of  the  network 
to  stale  reachability.  He  restricts  the  system  strategy  (/,  /)  to  be  conservative  linear.  Note  that 
(/,  /)  is  conservative  if  at  each  node  j,  total  inflows  equal  total  outflows,  and  (f,  f)  is  linear  if 
/=  Az(t)  +  Bu(t)  and  f  =  Cz(t)  +  Du(t). 

Frank  shows  that  a  given  final  state  z(T)  is  reachable  from  the  zero  state  in  time  Fif  and 
only  if  there  exists  a  state  a  >  z(T)  in  the  subspace  spanned  by  the  columns  of  the  controlla¬ 
bility  matrix  Q  given  by 

(2)  Q  =  IB,AB.A2B . At~'B]  . 

Furthermore,  z(T)  is  reachable  from  the  zero  state  (in  a  finite  time)  if  and  only  if  it  is  reach¬ 
able  from  the  zero  state  in  time  r,  where  r  is  the  degree  of  the  minimal  polynomial  of  A.  Frank 
goes  on  to  examine  the  necessary  and  sufficient  conditions  for  the  existence  of  a  linear  strategy 
under  which  a  given  set  of  terminal  states  are  reachable  (from  the  zero  state). 


Finally,  Frank  oulines  a  procedure  to  find  the  set  of  reachable  states  for  a  linear  system 
with  upperbounds  on  input  u(r),  i.e.,  u(t)  <  U ,  Vr.  Note  that  it  is  no  longer  possible  to  set  a 
uniform  upper  limit  r  on  the  number  of  time  periods  necessary  to  reach  a  given  state. 


Frank  and  El-Bardai  [18]  also  impose  upperbounds  on  the  flows  in  branches  of  the  net¬ 
work.  For  this,  let  c  =  (cw  c2 . cm)  denote  the  branch-capacity  vector.  We  can  now  express 

the  linear  strategy  with  capacity  constraint  as: 

z(/  +  1)  =  minMz(f)  +  Bu(t),c] 

(3)  and 

y(t  +  1)  =  Cz(t)  +  Duit). 

Frank  and  El-Bardai  show  that  with  the  input  constraint  u(t)  <  (/,  the  set  of  reachable  states 
(in  time  T )  in  system  (3)  is  given  by 

d\d  <  £  d(t) 

i- 1 

where 

d(  1)  =  min[fl(/,c] 


and 


d(t  +  1) 


min 


Ad(t),c  — 


£  d(r) 

T-l 


t  =  1,2 . (T  —  1).* 


'Frank  and  El-Bardai  also  derive  a  similar  result  for  systems  with  memory •  (but  without  input  constraints).  That  means, 
real  number  Wj  >  0  can  be  associated  with  node  j,  representing  the  maximum  amount  of  flow  that  can  be  stored  at 
node  j.  Note  that  any  excess  flow  to  be  stored  at  node  j  will  be  lost.  Obviously,  if  a  state  is  reachable  in  a  memoryless 
system  (i.e.,  one  in  which  no  flow  can  be  stored),  it  is  reachable  in  a  system  with  memory.  The  converse  is  not  neces¬ 
sarily  true. 
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So  far,  we  have  dealt  with  the  question  of  state  reachability  which  is  the  same  as  the  ques¬ 
tion  of  feasibility  in  operations  research.  The  answer  involved  applying  a  large  enough  input  in 
every  period  to  saturate  all  branches.  It  may  be  that  a  given  demand  vector  can  be  satisfied  in 
time  T  without  saturating  all  branches.  This  raises  the  question  of  optimality.  The  idea  is  to 
find  the  ‘smallest'  inputs  with  which  a  demand  vector  may  be  satisfied. 

The  problem  of  finding  such  an  input  sequence  can  be  easily  formulated  as  a  linear  pro¬ 
gramming  problem.  That  is,  given  z(0)  and  parameters  A,  B,  zf ,  C  and  U,  find  u(t)  and  z(f) 
that  minimizes 

I  ±uj(t). 

i-0  j- 1 

subject  to 

z(t  +  1)  <  Az(t)  +  Bu(t)  t-  0.1,2 . (T-l) 

zO)  >  zf 

0  <  z(t)  <  c,  t  -  1,2 . T 

0  UfO)  <  U,  j  -  1.2.  ....  n,  t  =0,1,2 . (T  -  1)  . 

The  solution  of  this  linear  programming  problem  will  yield  a  sequence  of  inputs  that  takes  the 
state  z(0)  of  G  to  a  state  z(T)  ^  zf  in  a  way  that  is  optimal  in  the  input  sense.  An  important 
property  of  this  sequence  is  that  it  is  also  optimal  in  the  loss  sense ,  i.e.,  it  minimizes  loss  in  the 
network. 

It  is  the  idea  of  optimality  with  which  we  are  really  concerned  in  this  survey.  Tapiero  and 
Soliman  [51]  address  themselves  to  this  important  problem  of  optimality  in  dynamic  transporta¬ 
tion  networks  with  transport  delays.  They  use  the  continuous-time  state-space  framework  and 
the  maximum  principle  for  their  analysis.* 

6.2.  Tapiero  and  Soliman  |5l!  and  Tapiero  1491 

Tapiero  and  Soliman  formulate  a  dynamic  multi-commodity  transportation  problem  with 
time  delays  as  an  optimal  control  problem.  For  simplicity  in  exposition,  we  will  only  develop 
the  single-commodity  case  formulated  in  Tapiero  [49]  since  its  extension  to  the  multi¬ 
commodities  case  is  rather  straightforward;  all  our  discussion  addressed  to  the  single¬ 
commodity  case  is  applicable  to  the  multi-commodity  version. 

6.2.1.  The  Model 

To  develop  the  model,  let  there  be  m  sources  and  n  destinations.  Let  /  denote  source  / 
and  j  denote  sink  j.  Let  t  denote  the  time.  We  can  now  define  the  following  variables  and 
parameters: 

Uij(t)  =  rate  of  flow  from  source  /  enroute  to  j  at  time  r, 
x,(t)  =  inventory  at  source  /  at  time  r, 


•While  we  survey  only  his  latter  papers,  Tapiero's  1971  paper  is  the  first  paper  applying  the  maximum  principle  to  the 
dynamic  transportation  problem.  The  paper  considers  the  problem  without  delays.  We  also  note  that  more  recently 
Segall  and  Moss  [37]  have  dealt  with  a  similar  problem  for  a  communication  network. 
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yj(t)  -  inventory  at  sink  j  at  time  I, 

Ty  *  transit  time  from  /  to  j, 

Cy  —  maximum  flow  capacity  of  route  Uj), 

K  —  total  flow  capacity  of  the  transport  system, 

T  —  planning  horizon, 

A,  —  initial  supply  at  source  / ;  i.e.,  x,(0)  -  Ah 

Bj  »  demand  at  sink  j  at  time  T ;  i.e.,  yj  ( T)  -  Bj, 

L[x,(t).  yj(t),  uy(t),  /}  -  cost  function  expressing  inventory  and 

transportation  costs. 

Note  that  it  is  necessary  to  have  the  total  available  supplies  exceed  or  equal  the  total  available 
demand.  We  may  assume  (without  any  loss  of  generality) 


(4) 


m  n 

I  A,-  I  Bj. 

.-I  i- i 


We  can  now  state  the  optimal  control  problem  as  follows: 


(5) 


minjy  -  LUtt),  yj(f).  uy(t).  t }  rfrj , 


subject  to 

(6)  - 

x,(t)  -  £  «</(*),  x,(0)  -  A,,  x,(T)  -  0,* 

J- 1 

(7)  m 

fijit)  -  £  tyO  -  Ty).  yj(s)  -  0,  Vs  <  0;  y,(r)  -  BJt 

i 

and  the  capacity  constraints 

(8) 
and 
(9) 


I- 1 


0  <  U(y(l)  ^  Cy 


0<  tt  UyO)  <  K. 
/-i  y-i 


•Note  that  the  terminal  condition*  of  x,  can  be  rewritten  as  jt,(T-min  ru)  -  0.  We  note  that  Tapiero's  formulation  has 

misprints  in  the  specification  of  terminal  conditions  of  x,  and  y,.  In  passing,  we  may  also  remark  that  there  are  other 
errors  in  Tspiero  (49)  snd  Tapiero  and  Soliman  (SI). 
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Note  that  x(  and  are  the  state  variables  and  ut]  are  the  control  variables  in  the  terminology  of 
optimal  control  theory.  We  refer  to  (9)  as  a  linking  constraint  since  it  introduces  interdepen¬ 
dencies  among  flows  on  various  routes. 

It  is  convenient  to  transform  this  formulation  into  an  equivalent  formulation  by  defining  a 
new  variable  ijy(f)  denoting  the  cumulative  flow  (up  to  time  /)  which  has  left  source  /  for  sink 
j.  The  equivalent  problem  is: 

fQ  A  VyO),  tldtj.* 

.subject  to 

VyO)  “  «</0).  t|//(i)  -  0,  V5  6 

with  constraints  on  terminal  conditions 

X  Vij(T)  -  X  ViAT~7 ij)  -  A„ 

i- i  J- i 

X  «  £  Vij(T-t0)  -  Bj, 

i-l  i-l 

and  the  control  constraints  (8)  and  (9). 


(12) 

(13) 


6.2.2.  Application  of  the  Maximum  Principle 

We  are  now  ready  to  apply  the  maximum  principle  with  delays  in  the  state  variables  (see 
Kharatishvili  [30];  see  also  Ray  and  Soliman  [36]).  The  Hamiltonian  is: 

(14)  m  n 

H  -  -\{r\iJU),r\ijU-rij),  uu(t),t)  +  £  £  X(>(f)  utf(f) 

/-I  j- i 

where  the  adjoint  variables  X,y(r)  satisfy 

(15)  i  r,\  flA(/}  9AIi+t0  ) 

r  dvijO)  dViJU ) 

with  the  transversality  conditions  (see  Tapiero  and  Soliman  [51]  and  Tapiero  [49]) 


(16)  \jj(T  —  t ij)  +  t mn)  =  \mj(T ~ t mj)  +  \in(T—  r jn),  /-  1,  2 . (m  —  1)  , 

j  -  1,2,  . . .,  («  -  1). 

It  is  noted  that  the  adjoint  variables  in  optimal  control  play  a  rote  similar  to  the  Lagrange  multi¬ 
pliers  in  nonlinear  programming.  For  explanation  of  the  transversality  conditions,  see  Fleming 
and  Rishel  113],  Tapiero  [50],  and  Sethi  and  Thompson  [39]. 


•Note  that  *,(/)  -  A:  -  X  ’!«(/)  and  >y(/)  ”  X  and,  therefore,  the  new  loss  function  (denoted  in  ab- 

i-\  r-1  n  m 

breviated  form  by  A|r|)  is  obtained  as  A{r  I  -  IAA,  -  £  ■»>,/</).  £  i}uU  -  r,,).  uu(i).  n  . 

I  /-I  t-1  I 
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A  necessary  condition  for  optimal  transportation  is  that  the  Hamiltonian  be  a  maximum 
(subject  to  control  constraints  (8)  and  (9))  along  the  optimal  path.  This  is,  in  general,  a 
difficult  problem  to  solve.  However,  it  may  be  possible  to  resolve  the  problem  in  some  special 
cases.  Tapiero  takes  up  the  minimum  time  problem  as  the  special  case  for  consideration. 

6.2.3.  Minimum-Time  Problem 


The  minimum-time  transportation  problem  is  concerned  with  determining  a  transport 
schedule  which  will  transport  supplies  at  m  sources  to  meet  the  demands  at  n  sinks  in  minimum 
time.  This  problem,  especially  its  multi-commodity  version,  is  extremely  important  in  army 
logistics,  in  supplying  help  to  disaster  areas  and,  possibly,  in  transporting  perishable  goods.  For 
the  case  of  discrete  time,  we  have  discussed  this  problem  in  Section  3. 


For  minimum-time  problems  the  objective  function  (10)  is  the  total  time  T,  thus 


That  means  \u  -  0  in  (15),  implying  that  the  A„  are  mn  (unknown)  constants  satisfying 
(m  -  1)  (n  -  1)  linear  equations  (16).  With  this  information,  we  can  rewrite  the  Hamiltonian 
as:  where 


(18) 


m  n 

//  — l  +  I  I  *</  «(,(/). 

i-i  j-  i 


Since  it  is  easier  to  deal  with  the  minimum-time  problem  without  the  transport  system’s 
capacity  constraint  (9),  we  will  take  this  up  next. 


6.2.4.  Minimum-time  Problem  Without  (9) 


Since  the  Hamiltonian  (18)  is  linear  in  uu  and  since  the  only  constraints  we  need  to  worry 
about  are  the  route  capacity  constraints  (8),  which  are  not  of  the  linking-type,  it  is  easy  to  see 
that 


(19) 


where 


k,i  <  0  —  u„(t )  =  0  .  V/. 


[«,/(/)  “  c,/.  i  < 

lo  .  otherwise. 


(20) 


This  means  that  along  any  arc  (ij),  either  there  is  no  shipment  or  there  is  a  shipment  at 
full  capacity  from  time  zero  to  some  time  t0  <  T.  This  observation  allows  us  to  set  up  a  linear 
program  which  will  solve  the  minimum-time  problem  without  (9): 
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Min  T 

subject  to 

‘ij  <  T  -  TU>  '• 

n 

£  ‘'ij  ‘ij  =  /^/> 

j- 1 

(21) 

m 

I  Cyty=  Bj.  V/. 

/-] 

ty  >  0,  V/J. 

This  L.P.  problem  will  give  the  optimal  ty  and  T  provided  there  is  a  feasible  solution  to 
the  given  problem.  We  remark  that  in  the  optimal  solution,  T  -  min  Uy  +  Ty). 

Note  that  it  may  not  be  easy  or  even  possible  to  modify  this  program  to  take  into  account 
(9)  and  still  retain  an  L.P.  problem.  The  existence  of  other  ways  to  find  an  optimal  solution 
with  (9)  is  our  next  concern. 

6.2.5.  Minimum-Time  Problem  with  (9) 

If  the  linking  constraint  (9)  is  binding,  there  will  be  some  kind  of  priority  assignment  in 
the  set  of  routes  (ij).  Obviously,  the  Hamiltonian-maximizing  condition  will  imply  that  this 
priority  assignment  be  reflected  in  the  values  of  k0. 

Suppose  that  we  have  a  given  set  of  values  for  A y  which  satisfy  (16).  There  are  many 
possible  sets  that  may  do  this.  Note  that  ky  3  1,  ViJ  always  satisifies  (16).  Given  the  set  of 
values  for  A,,,  the  Hamiltonian  maximization  is  a  knapsack  problem,  in  fact  a  parametric  knap¬ 
sack  problem.  A  tentative  outline  of  this  procedure  is  suggested  below;  see  Sethi  and  Book¬ 
binder  [38]  for  more  details. 


We  begin  by  listing  positive  A y  in  decreasing  order.  Starting  with  the  route  associated 
with  the  largest  Atf,  we  go  down  the  list  by  shipping  the  maximum  flow  c0  in  the  associated 
route  (i,j)  until  all  routes  with  positive  A u  are  full  or  the  sum  of  the  flows  equals  or  just 
exceeds  the  total  system  capacity  K.  If  it  exceeds  K ,  we  cut  back  the  flow  in  the  last  route  just 
enough  so  that  the  sum  of  the  flows  equals  K. 

The  flows  defined  above  commence  at  time  /  =  0  and  continue  until  the  first  t,.  defined  in 
(20)  is  encountered.  Let  this  be  r,^.  This  ti^]  can  be  of  one  of  two  types  depending  upon 
whether  the  first  or  the  second  argument  in  (20)  is  smaller. 

a)  If  ti)J]  is  of  the  first  type,  set  all  u, =  0,  Vr  >  r,  ■ 

b)  If  t,iJ]  is  of  the  second  type,  set  all  uUl  =  0,  Vr  >  t,  j . 

We  then  go  back  to  assigning  the  flows  according  to  the  knapsack  procedure  by  making  a 
new  list. 

These  iterations  continue  until  no  further  flow  can  be  assigned.  At  this  point,  there  are 
two  possible  situations.  Either  the  terminal  conditions  (12)  and  (13)  (equivalent  to  terminal 
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conditions  in  (6)  and  (7))  are  met  in  which  case  we  term  the  set  of  A,,  a  feasible  set,  or  they 
are  not  met,  in  which  case  the  set  of  A(>  is  clearly  infeasible. 

If  infeasible,  we  have  to  get  another  set  of  A,,  satisfying  (16)  and  start  over  again.  If 
feasible,  we  have  obtained  a  local  minimum.  It  may  not  be  possible  to  obtain  the  global 
minimum  since  this  may  require  obtaining  all  the  local  minima,  of  which  there  could  be  a  large 
number. 

To  conclude  this  section,  we  note  that  it  would  be  more  realistic  to  replace  the  total  sys¬ 
tem  constraint  (9)  by  loading  and  unloading  constraints, 


UjjU)  <  b„  V/, 


Uij(t)  <  cJt  Vr. 

be  easily  modified  to  deal  with  (22)  and  (23). 


Tapiero  and  Soliman  [51]  have  treated  the  multicommodity  version  of  the  minimum-time 
problem.  Their  paper  does  not  contain  a  proof  of  their  algorithm  It  should,  however,  be 
noted  that  in  the  multicommodity  case,  an  essential  feature  of  the  problem  is  the  linking  con¬ 
straint.  To  state  the  linking  constraint  (the  analogue  of  (9)),  let  there  be  r  commodities  and  let 
subscript  k  denote  the  klh  commodity.  The  linking  constraint  can  be  stated  as 

(24) 

£  <*kuiJkU)  <  Q.  Vf, 

*-i 

where  ak  is  the  capacity  required  per  unit  of  commodity  k.  These  constraints  are  analogous  to 
(9)  in  so  far  as  their  effect  in  developing  an  algorithm  is  concerned. 

We  offer  the  following  remarks  concerning  (24).  The  constraint  states  that  the  total  ship¬ 
ment  leaving  source  /  at  time  t  enroute  to  sink  j  should  not  exceed  Cv.  However,  the  loads-in- 
transit  at  any  point  along  arc  (ij)  need  not  be  less  than  Q.  An  example  will  further  clarify 
this  issue.  Suppose  there  is  a  bridge  along  arc  U,j)  which  is  a  bottleneck  and  whose  capacity  is 
Cij.  Imposition  of  constraint  (24)  does  not  guarantee  that  at  some  time  later  than  /,  there  will 
not  be  a  load  on  the  bridge  in  excess  of  Qj.  Different  commodities  k  may  require  different 
times  TjJk  to  travel  from  source  i  to  sink  j,  and  it  is  possible  for  them  to  leave  the  source  /  satis- 
ying  (24)  and  yet  exceed  C,y  on  the  bridge.  Tapiero  and  Soliman  have  not  recognized  this  in 
stating  the  arc-capacity  constraints  (24).  Once  again,  constraints  (24)  are  more  like  loading 
constraints  associated  with  arc  (ij)  rather  than  the  arc-capacity  constraints.  Note  also  that  the 
arc-capacity  constraints,  such  as  in  the  case  of  the  bridge  situation  above,  will  be  extremely 
difficult  to  handle. 


£ 


(22) 

and 

(23) 

m 

o<  £ 

i-i 

The  algorithm  suggested  in  this  section  can 

6.2.6.  The  Multicommodity  Case 
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6.2.7.  Linear  Inventory  and  Transportation  Costs 

Tapiero  and  Soliman  [51]  have  also  considered  the  problem  wherein  the  inventory  and 
transportation  costs  are  assumed  linear.  In  this  case,  the  loss  function 


(25) 


L  =  £  a,x,  +  £  dj)>j  +  £  £  q,j  Uy, 
/-i  j—  i  j- 1  /-I 


where  a,  and  dj  are  the  costs  per  unit  time  of  holding  a  unit  of  inventory  at  source  /  and  at  sink 
j,  respectively,  and  qt]  is  the  cost  per  unit  time  of  transporting  the  commodity  at  a  unit  rate.  In 
terms  of  j)y,  we  can  write  the  loss  function 


A{t|  =  £  a\A,  -  £  Vij(t)]  +  £  £  djUtjil  -  Ty) 
1-1  /-I  /-!  /-I 


(26) 


+  Z  Z  du  “,,(')• 

j- 1  1-1 


The  Hamiltonian  can  be  written  as 


(27) 


m  n  n  m 

H - £  a,lA,  -  £  r)ij(t)]  -  £  £  d^yO  -  tu) 

/-I  j- I  j-\  i- 1 

+  Z  Z  l-dij  +  ^y(/)]“i>('). 

j- 1  i- 1 


where  the  adjoint  variables  X,7(r)  can  be  expressed  as 


'  '  Xiy(r)  =  \y(T  -  t ij)  +  (a,  -  dj)(T  -  r y  -  t). 

with  \jj(T  -  Ty)  satisfying  (16).  Furthermore,  it  is  obvious  that 

(29)  a i  >  dj  — *  \jj (t)  decreases  with  r, 

a/  <  dj  — *  kjj(t)  increases  with  1. 


But  the  Hamiltonian  (27)  is  linear  in  uu,  implying  that  the  optimal  control  is  bang-bang. 
From  (29),  we  can  conclude  that  for  the  problem  without  (9),  the  form  of  the  optimal  policy 
for  each  are  (ij)  can  be  characterized  by  one  number  ti}  if  a,  >  d,  and  t0  if  a,  <  d, : 


(30) 


Qj  >  dj  —  Uyit) 


Cjjt  0  <  /  <  ty 

0,  otherwise, 


dj  <  dj  —  UyU) 


0,  0<  /  <  ty 

^ij »  ty  ^  t  ^  T  T  y . 


This  policy  is  consistent  with  our  intuition.  It  states  that  if  the  inventory  cost  at  source  / 
is  higher  than  that  at  sink  j,  then  shipment  along  arc  ( ij ),  if  any,  must  commence  as  early  as 
possible.  On  the  other  hand,  if  a,  <  ds  then  delay  the  shipment  along  arc  ( ij ) ,  if  any,  as  long 
as  possible. 
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Having  characterized  the  form  of  the  optimal  policy,  it  is  possible  to  formulate  a  quadratic 
program  to  solve  the  linear -costs  transportation  problem  without  (9) .  This  is  done  in  a  manner 
similar  to  that  in  Section  6.2.4.  We  note  that  Tapiero  and  Soliman  (51]  had  earlier  made  this 
observation.  For  the  actual  formulation  of  the  quadratic  program,  see  Sethi  and  Bookbinder 
[38].  Finally,  Sethi  and  Bookbinder  also  attempt  to  obtain  an  algorithm  for  the  problem  with 
(9).  This  is  similar  but  far  more  difficult  than  the  procedure  in  Section  6.2.5. 

6.2.8.  Linear  Inventory  and  Quadratic  Transportation  Costs 

Linear-quadratic  control  problems  have  a  special  place  in  the  optimal  control  theory.  Usu¬ 
ally  these  problems  yield  closed-form  solutions.  Midler  [34]  treated  a  transportation  problem 
with  linear  transportation  costs  and  quadratic  inventory  costs  in  the  stochastic  dynamic  pro¬ 
gramming  framework.  Tapiero  and  Soliman  [51]  treat  a  similar  problem  with  delays  in  the 
deterministic  framework,  but  with  linear  inventory  costs  and  quadratic  transportation  costs.  To 
state  the  latter  problem,  we  only  need  to  replace  the  q,j  u u  term  in  (25)  by  q^iuj  -  u,j)2,  where 
My  is  the  most  desirable  shipment  rate  from  the  point-of-view  of  transportation.  Furthermore, 
constraints  ( 8 )  and  (9)  are  assumed  to  be  no  longer  acting  for  this  problem. 

The  Hamiltonian  of  this  problem  can  be  written  as 

m  n  n  m 

H  =  -  Z  ~  Z  1  'I  I  djVij(t  ~  rij) 

/-I  7-1  7-1  i-l 

n  m  n  m 

’  ~  L  Z  9,i(u,i(t)  -  U,i)2  +  jr  j^AyWyU), 

;-i  /-I  7-1  <«i 

where  the  adjoint  variable  \y(/)  satisfies  (28)  and  (16).  Furthermore,  the  Hamiltonian  maxim¬ 
izing  condition  yields 

U*j(t)  =  My  +  \y(r)/2<7y, 

where  u*uU)  is  the  optimal  control  if  m*  (t)  >  0,  Vr.  We  will  assume  this  to  be  the  case;  oth¬ 
erwise  the  problem  becomes  more  difficult. 

We  can  now  use  (32)  in  (11)  to  obtain  tu)  which  must  satisfy  (12)  and  (13). 

These  conditions  can  be  simplified  as: 


uj;  „  r_T. 

Z  — ~  -  t  jj)  +  (a,  -  dj)(T  -  t  y )/  2  ]  =  Ah  i  =  1,  2 .  m, 

7- 1  i(lij 
/‘J/I'l  m  T-t 

1  Z  — ~  [Xy(7--Ty)  +  (m,  -  dj)(T-  Ty)/2]  =  Bj,  j-  1.2 . (n  -  1)  . 

We  have  not  written  (34)  for  j  =  n  since  it  is  redundant  on  account  of  condition  (4),  which 
states  that  total  demand  equals  total  supply. 


The  system  of  Equations  (33)  and  (34)  contains  m  +  n  -  1  equations.  Along  with  (16) 
which  contains  (m  -  1)  (n  -  1)  equations,  we  have  a  total  of  mn  equations  which  must  be 
satisfied  by  mn  variables  Ay (T  -  T,y).  The  solution  of  this  linear  system  of  equations  should 
provide  the  optimal  control  when  substituted  in  (32). 
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7.  SUGGESTED  IMPORTANT  EXTENSIONS 

A  natural  extension  of  the  dynamic  transportation  problem  with  delays  involves  time- 
dependent  demands  at  various  sinks.  If  we  let  (jU)  represent  the  demand  rate  at  sink  j,  then 
(7)  becomes 


(35) 


y,  -  L  ««<'  -  To> 

/-i 


with  an  additional  constraint  that 


yjU)  >  0.  Vr. 

For  this  extension,  it  may  be  necessary  to  either  assume  ^(0)  >  0  or  £,(/)  -  0  for 
0  ^  t  ^  min  T..  to  have  feasibility.  Another  way  to  handle  this  problem  is  to  allow  shortages 

i 

and  incorporate  shortage  costs  in  the  objective  function.  See  also  Midler  |34)  in  this  connec¬ 
tion. 


A  simple  and  interesting  extension  is  the  case  of  perishable  goods.  If  we  assume  a  con¬ 
stant  spoilage  rate  y  in  transit,  it  amounts  to  replacing  (7)  by 

(37)  ..  ,  J5,  ,  ,  -yr„ 

y(t)~  X  u0(t-T0)e  u  . 

i- 1 

In  this  case  we  cannot  require  condition  (4),  since  that  would  imply  infeasibility.  Conse¬ 
quently,  feasibility  also  becomes  an  issue  in  this  case.  Of  course,  the  cost  of  spoilage  must  be 
incorporated  in  the  objective  function  by  modifying  unit  transportation  costs  on  the  arcs  of  the 
transportation  network. 

Another  extension  is  the  case  in  which  there  are  unloading  delays  at  various  sinks.  Typi¬ 
cally,  the  unloading  delay  at  a  sink  will  depend  on  the  inventory  at  the  sink  and  (possibly)  the 
time  of  unloading.  The  latter  dependence  may  derive  from  an  expansion  program  which  is  in 
progress  at  the  sink  under  consideration.  For  the  single  commodity  case,  this  amounts  to 
replacing  (7)  by 

•  yjU)  -  UfjO  -  Tj(yj(t),r)) 

i-i 

where  the  delay  r ,  may  be  assumed  to  satisfy  dr/d.v,  >  0  and  br  Jbt  <  0.  Ray  and  Soliman 
(36)  have  a  weak  maximum  principle  for  dealing  with  systems  subject  to  equations  of  the  form 

(38) . 


Perhaps  the  most  important  extension  is  the  case  in  which  the  time  required  lor  a  ship¬ 
ment  to  reach  from  source  i  to  sink  j  is  a  function  of  the  amount  shipped.  That  is 

Ty-Tyfu),  drJdu^'Z  0. 

In  this  case  the  transit  time  increases  with  the  amount  shipped.  For  the  single  commodity  case, 
this  amounts  to  replacing  (7)  by 

yj(t)-  f,  £  «(/«». 

(-1  I#  I*  +  T,  >1— f  I 


(40) 
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We  note  that  this  equation  is  an  essential  feature  of  any  dynamic  transportation  problem  in 
which  the  transit  time  is  a  function  of  the  load-in-transit. 

Equation  (40)  is  a  state  equation  in  which  not  only  the  delays  depend  on  controls  at  previ¬ 
ous  times  but  also  these  delays  are  defined  recursively.*  We  do  not  know  of  any  maximum 
principle  allowing  even  for  delays  depending  upon  controls  at  an  earlier  time,  let  alone  where 
that  time  is  defined  recursively.  Ray  and  Soliman’s  [36]  paper  contains  only  a  weak  maximum 
principle  for  systems  with  delays  depending  on  state  and  control  at  time  t  and  time  t. 

Treatment  of  state  equations  of  the  type  (40)  will  obviously  require  some  theoretical 
developments,  including  a  suitable  maximum  principle.  For  the  time  being,  we  are  working 
with  a  discrete-time  version  of  (40),  employing  dynamic  programming  and/or  some  theory  for 
optimization  systems  developed  by  Clarke  [8]. 

Finally,  the  problem  could  be  extended  to  take  into  account  the  stochastic  nature  of  tran¬ 
sit  delays.  This  would  require  replacing  (7)  by  a  stochastic  differential  equation.  One  could 
also  consider  demands  at  sinks  to  be  stochastic  (see  Midler  [34]  and  Segall  and  Moss  [37]). 
Either  case  results  in  difficult  stochastic  optimal  control  problems.t 
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ABSTRACT 

This  paper  presents  an  application  of  a  method  for  finding  the  global  solu¬ 
tion  to  a  problem  in  integers  with  a  separable  objective  function  of  a  very  gen¬ 
eral  form.  This  report  shows  that  there  is  a  relationship  between  an  integer 
problem  with  a  separable  nonlinear  objective  function  and  many  constraints  and 
a  series  of  nonlinear  problems  with  only  a  single  constraint,  each  of  which  can 
be  solved  sequentially  using  dynamic  programming.  The  first  solution  to  any  of 
the  individual  smaller  problems  that  satisfies  the  original  constraints  in  addi¬ 
tion,  will  be  the  optimal  solution  to  the  multiply-constrained  problem. 


INTRODUCTION 

Let  us  define  a  nonlinear  integer  programming  problem  using  the  following  notation: 
(1)  Max  2  =  £  ./}(*,) 

j- 1 


(2) 

such  that  It,  (x)  < 

0  /  -  1.2,  .. 

.  ,  m 

(3) 

Xj  €  lp 

j  =1.2.  .. 

.  ,  M 

where  fJ:  Ip  —  Rp 

j=  1,2,  .. 

•  ,  n, 

and  hr.  I„  —  Rp 

1=1,2,  ... 

,  m  and  x  =  (x\,x2t  .. 

•  »  xn). 

Additional  assumptions  are  as  follows: 


1.  The  functions/}  (•)  satisfy  a  necessary  conditon  for  dynamic  programming. 

2.  The  region  defined  by  (2)  and  (3)  is  non-empty,  with  at  least  one  lattice  point  in  the 
region  and  the  region  is  bounded. 

In  [1]  and  [2]  algorithms  are  developed  to  find  candidate  lattice  points  on  a  series  of 
hypersurfaces.  A  dynamic  programming  technique  is  used  to  solve  a  sequence  of  problems 

PRECEDING  PAGE  BLANK  -  NOT  IlLfcED 


90 


M.W.  COOPER 


with  only  a  single  constraint.  Each  of  these  problems  corresponds  to  a  value  of  the  objective 
function,  and  this  value  is  reduced  from  an  upper  bound  until  a  feasible  lattice  point  is  found. 
In  [1]  the  value  of  the  objective  function  is  reduced  by  one  unit  for  each  successive  single  con¬ 
straint  problem.  In  [2]  an  argument  is  made  for  allowing  reduction  of  the  trial  value  by  more 
than  a  unit  amount  using  numbers  corresponding  to  hyperplanes  with  integer  solutions  in  the 
dynamic  programming  return  function  tables.  Therefore,  only  hypersurfaces  which  contain  lat¬ 
tice  points  are  investigated  and  the  speed  of  the  algorithm  is  greatly  improved.  The  effort  con¬ 
tained  in  this  paper  is  an  explication  of  the  algorithm  developed  in  [2)  and  its  application  to 
problems  with  a  different  type  of  objective  function:  namely,  those  for  which  the  separable 
functions  are  such  that 

fj  (  ):  Ip~*  RP  j  =  1,2,  ...  n. 

(It  was  previously  assumed  that  (•):  Up  —  lp,  j  -  1,2 . n).  Such  problems  would  typi¬ 

cally  have  an  objective  function  like  z  =  x\n  -I-  3x2l/4  -I-  4x32.  It  will  be  shown  that  the  algo¬ 
rithm  of  [2]  can  be  used  on  problems  where  the  objective  function  is  not  restricted  to  integers. 
The  algorithm  will  find  a  global  optimum,  even  for  these  less  restricted  problems.  Therefore, 
this  method  can  be  used  for  nonlinear  problems  such  as  the  concave  transportation  problem  or 
for  discrete  problems  in  which  the  objective  function  is  neither  concave  nor  convex. 

Summary  of  the  Algorithm 

Let  us  formulate  the  following  approach  for  solving  the  problem  described  in  (l)-(3).  In 
this  development,  we  will  consider  a  sequence  of  hypersurfaces  of  the  following  form: 

(4)  £  fj(xj)  -  zk,  A  =  0,1,... 

j- 1 

If  we  can  find  an  upper  bound  z0  on  the  optimal  solution,  then  we  search  the  hypersurfaces  (4) 
corresponding  to  a  sequence  of  values  of  zk  for  lattice  points.  We  choose  z0  so  that  z0 
corresponds  to  an  upper  bound  on  the  value  of  the  objective  function,  and  every  successive 
value  of  zk  is  such  that  zk  <  z0,  and  the  hyperplane  (4)  corresponding  to  zk  contains  an  integer 
point.  We  will  show  how  this  is  done  in  the  following  sections.  The  search  for  lattice  points  on 
these  hyperplanes  is  performed  by  a  dynamic  programming  algorithm.  Any  lattice  points  so 
found  are  only  candidates  for  the  optimal  solution  to  (l)-(3).  They  must  be  tested  finally  for 
feasibility  with  the  constraints  given  in  (2)-(3).  Using  this  method  of  examining  all  lattice 
points  which  correspond  to  some  hyperplane  (4),  then  testing  for  feasibility  with  the  original 
constraints  (2)-(3),  and  terminating  as  soon  as  a  feasible  lattice  point  is  found  is  the  sequence 
of  steps  of  the  algorithm  given  in  [1]  and  improved  in  [2],  The  method  in  both  references 
looks  at  all  the  lattice  points  corresponding  to  one  hypersurface  at  a  time  and  checks  any  points 
found  for  feasibility.  The  improvement  given  in  [2]  is  that  the  sequence  of  values  zk  can  be 
calculated  so  that  only  those  hypersurfaces  are  examined  that  are  known  to  contain  lattice 
points.  Since  the  sequence  of  values  zk,  A=0, 1,  ...  ,  is  strictly  decreasing,  and  if  we  have 
examined  all  possible  surfaces  that  contain  lattice  points  and  check  these  points  for  feasibility 
with  the  original  constraints  (2)-(3),  then  we  can  guarantee  that  the  first  feasible  lattice  point 
will  be  optimal.  In  the  special  case  in  which  some  of  the  constraints  are  separable  it  is  possible 
to  discard  or  fathom  partial  solutions  thereby  improving  the  speed  of  the  algorithm.  This  is 
done  by  calculating 

(5)  .  A 

^  -  b,  -  £  a,j  xj 

j~n-l 


for  each  partial  solution  x„,  x„_|,  ...  ,  x,.  If  s,  <  0  for  any  i,  i  -  1 . m,  the  partial  solu¬ 

tion  will  only  lead  to  infeasible  solutions  and  can  be  dropped. 
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This  procedure  is  able  to  use  several  advantageous  properties  of  dynamic  programming. 
First  we  may  find  global  optima  for  nonlinear  functions  and  identify  integer  points  efficiently. 
Second,  it  is  possible  to  extend  the  algorithm  to  handle  even  nonseparable  constraints  (as 
shown  in  the  numerical  example  (10)).  This  is  done  by  dropping  the  feasibility  test  for  con¬ 
straints  that  are  nonseparable.  So  the  calculations  of  s,  is  made  only  for  constraints  that  are 
separable.  If  all  constraints  are  nonseparable,  then  feasibility  is  checked  after  integer  points  are 
identified  and  there  is  no  possibility  of  fathoming  at  the  partial  solution  stage.  So  it  is  possible 
to  handle  highly  nonlinear  nonseparable  constraints.  Hence  the  emphasis  in  the  development 
of  this  algorithm  has  been  to  utilize  the  advantages  of  a  dynamic  programming  approach  while 
avoiding,  as  much  as  possible,  the  "curse  of  dimensionality"  which  has  doomed  such  efforts  in 
the  past. 

We  now  present  a  stepwise  description  of  the  algorithm. 

MODIFIED  HYPERSURFACE  SEARCH  ALGORITHM 

1.  Determine  upper  bounds  Uj  for  each  variable.  We  then  have: 

0  <  Xj  <  Uj  for  j  =  1,2 . n 

Xj  €  Ip 

2.  Compute  z0  if  one  is  not  known  a  priori  by 

Zo=  £  fj  ( Uj )  =  2k 
j- 1 

3.  Find  all  combinations  of  x;,  j  =  1,2 . n  which  satisfy: 

Z  //•*))  “  zk  k  =  0, 1,  .  .  . 

7-1 

0  <  x,  <  Uj  j  -  1,2,  ...  ,  n 

Xj  6  ln  j  =  1,2,  ....  n. 

4.  Test  any  integer  point  found  on  the  hypersurface  of  step  3  at  the  partial  solution  stage 
(for  separable  constraints)  for  feasibility  with  the  original  constraints  (2) -(3).  If  a  feasible 
point  is  found,  then  the  value  of  zk  is  tested  against  the  greatest  feasible  lower  bound  on 
z.  If  it  is  less  than  this  bound,  then  the  point  corresponding  to  that  bound  is  optimal. 
Otherwise  go  to  step  5. 

5.  Calculate  a  new  zk+\  according  to  logic  and  notation  developed  in  succeeding  sections. 

6.  Return  to  step  3. 

The  process  of  finding  lattice  points  in  step  3  is  achieved  by  using  a  dynamic  programming 
formulation  of  the  following  problem:  Find  all  Xj,  7=1,  ....  n  such  that: 

Z  fj  (Xj)  =  Zk 

7-1 

(6)  0  <  Xj  <  Uj  j  =  1,  ...  ,  n 

Xj  €  Ip  j  -  1,  ...  ,  n 

In  [1]  and  [2]  an  equivalent  formulation  of  (5)  is  given: 

Max  z  =  Z  //(*/) 

/-i 


* 


— 
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such  that 

n 

£  fj  ( Xj )  <  zk 

J-l 

(7) 

0  <  Xj  <  Uj  j  -  1,  . 

. .  ,  n 

x,Z  l„j-  1, 

. .  ,  n. 

F°r  every  value  of  zk  this  gives  us  the  values  xJt  j  -  1 . n  of  all  lattice  points  x  on  the 

objective  function  hypersurface.  Therefore,  this  problem  can  be  used  to  identify  integer  points 
x  at  any  value  zk  of  the  objective  function.  The  optimal  return  functions  are  given  by  the  fol¬ 
lowing: 


g](A)  =  Max  f  j (x ,)  = 

1  x, 

|-°o, 

x  =/,  (8,),  8,  =0,1,  .. 

otherwise 

•  ,  Ml 

g,(A)  =  Max  l/s(x,)  +  gs_, 

0<v,-8s 

,  s  =  2,3,  .. 

i  (A  -/v(x,)]  x  =0  , 

.  .  n 

■  .  A, 

where  8j  =  min  (us,  [^J) 

fs  (f ,)  =  X 


As  =  £  (Uj) 
j-l 

and  the  notation  [b]  indicates  the  largest  integer  <  (b). 

Equations  (7)  and  (8)  give  the  dynamic  programming  recursion  formulas  for  calculating  integer 
points  in  step  3. 

In  steP  5  a  new  objective  function  value  zk  is  calculated.  It  is  not  necessary  to  consider 
every  integer  value  between  an  upper  bound  on  the  problem  z0  and  the  optimal  objective  func¬ 
tion  value.  In  the  case  in  which  the  separable  functions  are  required  to  map  onto  the  range  / 
this  would  have  been  a  feasible  strategy.  However,  for  functions  /,:  /_  —  R.  this  method 
would  skip  any  noninteger  z.  Let  us  consider  a  different  method  of  calculating  the  sequence  z0, 
*'•  V  ’.Zk  in  step  5  We  know  that  only  values  corresponding  to  integer  points  will  be  finite 
entries  in  the  tabulation  of  the  return  function  &(A).  (Values  for  noninteger  x,(A)  are  associ¬ 
ated  with  gsU)  =- oo.)  Therefore,  let 

z*  =  X("-°  +  /„(x„)  x„  =  u„,  u- 1 . 0, 

where  X("  n  are  values  of  A  entered  in  the  g„_i(  )  function  tabulation.  This  formula  for  calcu¬ 
lating  zk  will  give  all  hypersurface  level  sets  which  contain  lattice  points.  The  interesting  fact 
for  the  current  application  is  that  this  calculation  will  give  all  zk  corresponding  to  hypersurfaces 
containing  lattice  points  even  for  noninteger  values  of  A*"-0  and  f„  (xn). 


A  NUMERICAL  EXAMPLE 

Let  us  consider  the  solution  of  the  following  example: 
Maximize 

z  =  6X|J  +  3xj/3  +  lx\n 


u I _ M I  m — 1 — 


such  that 
(10) 
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3x(  +  4x2  +  3x3  <  10 
2*|  +  3x2  +  3x3  <  10 
*ix22,  <  7 

X\,  x2,  x3  >  0,  integer 

then 

f \  (*|)  =  6x,2, 

/2  (x2)  =  3x2l/3 
/3  (x3)  =  2x31/2. 

from  the  constraints  of  (10)  we  see  that: 


0  ^  Xj  <  3  =  U\ 
0  <  x2  <  2  —  «2 
0  <  x3  <  3  =  U3. 


Therefore, 


8  iOO 


X, 

— oo 


6  Si2,  8,  =  0, 1,2,3 
otherwise. 


Let  us  tabulate  the  return  function  for  the  first  variable 


TABLE  1 .  Optimal  Return  and  Policy 
X  g)  (X)  xi»  (X) 

0  0  0 

6  6  1 

24  24  2 

54  54  3 

For  the  second  stage  the  dynamic  programming  recursion  is 

g2(X)  =  max  [3x2l/3  +  (X  -  3x21/3)] 

for82  =  min  (2,  (£21)  and  X  =  3£1/3. 

We  can  thus  compile  the  following  table: 


TABLE  2.  Optima I  Return  and  Policy 


X 

g?(x) 

x2*(X) 

X 

£200 

x2*(X) 

0 

0 

0 

24 

24 

0 

3 

3 

1 

27 

27 

1 

3.78 

3.78 

2 

27.78 

27.78 

2 

6 

6 

0 

54 

54 

0 

9 

9 

1 

57 

57 

1 

9.78 

9.78 

2 

57.78 

57.78 

2 

Now  we  may  choose  the  sequence  of  values  z0,  z\,  r2,  ...  by  using  the  following  rule: 

z*-X("-',  +  /3(x3). 
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where  X',_1  takes  on  all  values  entered  for  the  state  variable  in  Table  2,  and  x3  —  w3,  w3 

-  1 . 0.  It  is  shown  in  reference  12],  if  it  is  not  immediately  apparent  to  the  reader,  that 

lattice  points  may  lie  only  on  hypersurfaces  corresponding  to 

L  Mxj)  -  zk  =  *("~n  +  /„  (*„); 

/-I 

therefore, 

z0  =  X(2)  +  /3(w3)  =  57.78  +  2  •  3 1/2  =  61.24. 

The  corresponding  integer  solution  is  xj  =  3,  xj  =  2,  xf  -  3,  which  is  not  feasible  for  the 
constraints  given  in  (9).  Now  we  choose 

z,  =  X<2>  +  /3  (u3  -  1)  =  57.78  +  2  ■  21/2  =  60.61 

corresponding  to  x*  =  3,  x*  =  2,  x*  =  2.  Again  the  constraints  given  in  (9)  are  violated.  We 
continue  the  steps  of  the  algorithm  until  we  generate  a  feasible  lattice  point: 


z2  =  57.78  +  2  =  59.78;  xj^3,  x*=2,  x3=- 1  (infeasible) 

z3  =  57.78  +  0  =  57.78;  xf=3,  X2=2,  x3=0  (infeasible) 

z 4  =  57  +  3.465  =  60.46;  x*=3,  xj^l,  x*=*3  (infeasible) 

z5  =  57  +  2.828  -  59.828;  xf-3.  *J-1,  x?=2  (infeasible) 

z„  =  57  +  2  «  59;  xf=3,  xj-l,  x*-l  (infeasible) 

z7  =  57  +  0  =  57;  xf-3,  x*=l,  x?=0  (infeasible) 

z8  =  54  +  3.464  =  57.464;  xf-3,  xJ-0,  x^-3  (infeasible) 

zg  =  54  +  2.828  —  56.828;  xf=3,  xJ—0,  xj*2  (infeasible) 

Zio  =  54  +  0  =  56  xf— 3,  xj— 0,  xj—l  (infeasible) 

z3i  =  54  +  0  =  54;  Xv“3,  xjM),  x^O  (feasible). 


Now  we  have  a  bound  which  is  feasible  for  the  integer  problem.  However,  we  must  continue 
the  calculations  until  no  hypersurface  with  a  greater  z  is  unexamined.  If  we  check  zl2,  we  find 
that 

z 1 2  =  27.78  +  3.465  <  54 

and  all  subsequent  calculations  also  have  z  values  less  than  this  bound.  Hence  54  is  the  optimal 
value  of  the  objective  function  and  this  value  corresponds  to  a  global  maximum  at  X]  =  3, 
x2  =  0,  x3  =  0. 

COMPUTATIONAL  RESULTS 


Results  from  51  randomly  generated  problems  are  reported.  The  problems  were  of  the 
following  form. 

Max  z  =  £  />(*/) 

/-i 

(11)  J)  a, i  x,  <6,  /  —  1,2 . m 

where 


(12) 


f)  (•*/)  -  «,  x, ■  +  p,  x,2  +  y,  x> 


and  pn  y t  were  non-negative  integers.  Results  are  given  in  Table  3.  The  number  of  terms 
in  the  objective  function  for  n  =  20  could  be  as  large  as  3 n  =  60.  In  addition,  the  problems 
were  generated  so  that  the  upper  bounds  n;  for  each  x,  determined  from  the  constraints  (15) 
varied  among  the  values  0,  1,  2,  3  so  that  a  maximum  of  4  integer  values  need  be  considered 
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TABLE  3 


M 

O 

1 

N 

# 

m  x  n 

4  x  10 

m  x  n 

4  x  15 

m  x  n 

4  x  20 

m  x  n 

4  x  25 

m  x  n 

4  x  30 

79-38 

.92 

1.35 

32.38 

16.26 

69.86 

101-71 

1.16 

14.83 

45.66 

68.61 

52.28 

129-37 

2.38 

13.19 

16.45 

26.18 

63.77 

102-35 

2.45 

4.84 

25.45 

94.65 

229.74 

75-34 

5.28 

5.39 

47.01 

57.32 

35-16 

1.30 

9.74 

21.06 

27.35 

51-30 

.49 

26.53 

77.69 

35.65 

95-84 

.58 

3.47 

18.69 

62-23 

1.09 

10.20 

22.88 

53-28 

1.99 

3.09 

48.19 

58-29 

.38 

11.14 

68.62 

85-57 

.53 

6.82 

59-36 

.49 

9.27 

169-94 

11.71 

4.51 

61-28 

.76 

7.92 

80-28 

1.00 

18.12 

125-34 

4.52 

7.64 

53-25 

.32 

12.55 

77-23 

1.16 

17.07 

64-30 

.84 

23.81 

Total  (sec) 

39.35 

211.48 

424.08 

326.02 

415.65 

Avg.  (sec) 

1.97 

10.57 

38.55 

37.26 

103  91 

for  each  variable.  All  computations  were  carried  out  on  a  CDC  Cyber  70,  Model  72,  a  medium 
speed  computer.  An  important  feature  of  the  program  when  the  constraints  are  separable  is 
that  partial  solutions  may  be  discarded  when  they  become  infeasible  by  calculating  the  slack  in 
each  constraint  as  soon  as  Xj  is  determined.  This  procedure  is  not  carried  out  in  the  detailed 
example  of  the  previous  section  because  of  the  presence  of  a  single  nonseparable  constraint. 
Even  in  this  case,  when  some  constraints  are  separable,  partial  solutions  may  still  be  discarded 
using  only  the  separable  constraints. 


CONCLUSION 

This  paper  presents  an  exact  solution  method  for  an  extended  class  of  problems  with 

n 

highly  nonlinear  objective  functions,  £  fj  (x;),  that  have  the  property  that  /,:  IP  —  RP. 

i-j 

Hence  problems  with  an  extended  class  of  objective  functions— namely  those  that  have  rational 
values— may  be  solved  exactly  for  the  global  optimum.  The  method  is  one  example  of  an 
effort  to  utilize  the  favorable  features  of  dynamic  programming  while  avoiding  the  storage  and 
computational  difficulties  associated  with  multiply-constrained  problems. 
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abstract 

This  paper  is  concerned  with  models  of  the  use  of  performance  incentives 
in  the  Soviet  Union  and  United  States.  The  principal  analytical  result  is  an  ex¬ 
tension  of  an  analysis  of  the  methods  whereby  Soviet  planners  make  the  deci¬ 
sion  about  production  targets  a  variable  under  control  of  the  producer,  who  is 
the  only  one  possessing  a  knowledge  of  the  uncertain  condition  of  production. 

It  is  shown  that  this  device  can  be  viewed  as  a  classical  inventory  problem. 

There  is  also  an  examination  of  the  "U.S.  incentive  program”  referring  to 
multi-incentive  contracts  in  which  the  profits  received  by  the  private  producers 
are  related  to  performance,  outcome  and  cost.  The  analysis  describes  how  this 
device  can  be  extended  to  solve  the  target  output  selection  problem  of  the  So¬ 
viet  planning  system. 


INTRODUCTION 

The  central  planning  organizations  of  both  the  Soviet  Union  and  the  United  States  have 
been  concerned  with  influencing  the  behavior  of  enterprises  in  order  to  achieve  improved  allo¬ 
cation  of  resources.  Although  one  is  more  likely  to  associate  the  central  planning  task  with  an 
economy  such  as  the  Soviet  Union,  the  provision  of  many  goods  is  centrally  planned  in  the 
United  States.  For  example,  the  Planning-Programming-Budgeting  System  of  the  United  States 
government  can  be  viewed  as  part  of  a  central  planning  process.  In  the  analysis  I  compare  the 
incentive  systems  of  the  Soviet  Union  and  the  United  States,  thereby  clarifying  the  similarities 
that  exist  between  the  two  economic  systems.  A  suggestion  for  improvement  to  the  U.S. 
incentive  system  is  also  made. 

The  key  similarity  between  planning  in  the  Soviet  Union  and  the  United  States  is  that  the 
government  does  not  know  as  much  about  individual  values  and  technological  opportunities  as 
do  the  relevant  producers  or  consumers.  For  example,  in  the  United  States,  the  Planning- 
Programming-Budgeting  System  frequently  culminates  in  government  acquisition  of  goods  such 
as  military  hardware  and  space  systems  from  private  enterprises.  The  production  technolgies 
associated  with  many  of  these  goods  are  not  only  highly  uncertain,  but  are  also  more  accurately 
known  by  the  producers  than  by  the  government.  Such  goods  as  intercontinental  ballistic  mis¬ 
siles  and  manned  space  vehicles  have  embodied  in  them  advanced  technology  and  associated 
uncertainty  about  the  conditions  of  production.  This  uncertainty  is  probably  most  pervasive 
during  the  engineering  development  phase  of  the  "production"  process  when  the  performance 
characteristics  of  these  goods  are  determined.  The  producers  of  these  goods,  however,  will 
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typically  have  a  greater  knowledge  of  this  advanced  technology  and  its  impact  on  the  conditions 
of  production  than  does  the  government.  Thus,  the  conditions  of  production  are  more  uncer¬ 
tain  for  the  government  than  for  the  producer  at  that  time.  The  uneven  impact  of  uncertainty 
implies  that  it  is  not  possible  for  the  government  to  specify  the  best  output  level. 

A  similar  situation  exists  in  the  Soviet  Union.  The  state  enterprise  may  have  better 
knowledge  of  its  production  technology  than  the  planners,  and  the  Soviet  planners,  therefore, 
may  be  unable  to  specify  the  optimal  output  level.  Yet  both  the  U.S.  and  Soviet  decision  mak¬ 
ers  attempt  to  provide  appropriate  incentives  to  motivate  producers  to  select  the  "right"  output 
level. 


In  the  United  States  this  system  employs  the  so-called  "contractual  incentive  function" 
which  specifies  a  mutually  acceptable  rule  connecting  the  monetary  rewards  of  one  decision 
maker  to  the  subsequent  performance  of  another.  Numerous  enterprises  have  devised  profit- 
sharing  formulae  to  motivate  supervisory  and  managerial  personnel,  and  the  Department  of 
Defense  and  NASA  have  relied  on  the  use  of  performance  incentives  to  monitor  the  work  of 
major  contractors.  For  example,  performance  incentives  were  included  in  contracts  with  a  total 
value  of  several  billion  dollars  during  the  U.S.  moon  program.*  Recent  innovations  in  the  use 
of  performance  incentives  have  appeared  in  the  new  Amtrak  contract  which  provides  payments 
to  the  railroads  according  to  the  quality  of  services  they  provide  [2].  A  similar  contractual 
arrangement  guarantees  a  one  percent  increase  in  the  salaries  of  the  policemen  of  Orange,  Cali¬ 
fornia  for  every  three  percent  decline  in  rape,  robbery,  burglary  and  auto  theft  [151. 

Although  the  existing  literature  on  economic  planning  does  not  specifically  mention  the 
use  of  contractual  incentive  functions,  a  related  concept  has  arisen  in  discussions  of  "success 
indicators"  in  Soviet  planning.  The  Soviet  planning  system  often  rewards  enterprise  agents 
according  to  the  degree  to  which  producers  reach  certain  planned  targets.t  Thus,  Soviet 
planners  have  implicitly  defined  a  performance  incentive  system.  In  contrast  to  similar  systems 
employed  in  the  West,  the  Soviet  system  has  not  been  "contractual"  in  the  sense  that  it  has 
been  agreed  upon  by  the  planners  and  the  enterprise  managers.  Instead,  the  state  has  unila¬ 
terally  chosen  the  targets  and  rewards,  and  the  enterprise  managers  have  been  expected  to  com¬ 
ply  in  order  to  attain  their  own  maximum  reward  within  the  confines  of  the  rules  laid  down  by 
the  planners.  This  "non-contractual"  incentive  system  has  clearly  been  an  example  of  the  use  of 
performance  incentives  in  the  implementation  of  economic  planning. 

Recently,  the  Soviets  have  experimented  with  an  extended  incentive  system  which  pro¬ 
vides  motivation  for  the  state  enterprises  to  select  the  optimal  target  output  level  before  the 
determination  of  the  actual  output.  The  importance  of  this  additional  incentive  stems  from  the 
fact  that  if  the  central  planners  have  a  good  estimate  of  the  amount  of  the  good  which  will  be 


’Using  v,  and  v  to  represent  measures  of  relative  profit,  cost,  and  performance,  the  functional  form  which  applied  to 
several  of  the  large  dollar  value  incentive  contracts  used  during  the  U.S.  moon  program  is 

c  -  /'( v)  +  v(r)  +  iti'(x)x(> )  + 

where 

H\)  *  u j yc  .v  \c(v)  —  h | vc  1  v  3 

and  «|.  «2.  «j,  />|.  hj,  hj,  <r.  and  /J  are  constants.  In  addition,  between  1967  and  1970,  there  were  approximately  $2 7 
billion  of  multiple  incentive  contracts  evaluated  by  a  Department  of  Defense  analysis  group 

til  is  widely  accepted  that  this  is  the  Soviets'  most  famous  planning  problem.  In  addition  to  monetary  incentives,  the 
Soviets  have  also  tried  to  solve  this  problem  using  informational  exchange  during  bargaining  with  the  enterprise. 
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produced  before  it  is  actually  produced,  then  a  better  coordinated  plan  can  be  achieved.  There 
is  also  a  need  for  planners  to  coordinate  outputs  that  are  jointly  used.  For  example,  some  inter¬ 
mediate  goods  are  used  jointly  in  the  production  of  final  goods,  and  final  goods  may  be  jointly 
consumed.  The  reason  why  an  incentive  is  required  to  motivate  the  managers  of  the  state 
enterprises  to  reveal  the  optimal  target  is  that  the  enterprise  may  also  receive  a  reward  based  on 
the  actual  output  achieved  in  relation  to  the  target  output  level.  The  existence  of  this  reward 
may  motivate  the  managers  to  understate  the  target  output  level  if  they  are  simply  asked  its 
value. 

Martin  Weitzman  [16]  has  analyzed  this  new  incentive  system  using  a  model  whose  pro¬ 
perties  have  been  investigated  by  Fan  [6]  and  Bonin  [3],  In  this  model,  the  planners  fix  the 
resources  or  inputs  available  to  the  enterprise,  but  there  is  uncertainty  associated  with  the  out¬ 
put  that  can  be  produced  with  these  fixed  inputs.  The  uncertainty  rests  with  the  planners,  thus 
justifying  the  selection  of  the  target  output  level  by  the  enterprise.  Although  the  output  actu¬ 
ally  achieved  is  not  selected  by  the  enterprise,  Weitzman  shows  how  the  enterprise  can  use  its 
knowledge  of  the  uncertain  conditions  of  production  in  conjunction  with  a  specified  perfor¬ 
mance  incentive  to  select  the  best  target  output. 

This  report  will  first  review  the  Weitzman  analysis  and  then  show  that  the  new  Soviet 
incentive  program  can  be  viewed  as  a  classical  inventory  problem,  which  is  a  problem  of  deter¬ 
mining  how  much  of  product  to  keep  in  storage.  This  interpretation  of  the  incentive  program  is 
important  because  inventory  theory  is  a  well-developed  analytical  framework  and  general  associ¬ 
ations  between  inventory  theory  and  planning  may  prove  fruitful. 

Recently,  the  state  enterprises  in  the  Soviet  Union  have  been  given  greater  flexibility  in 
their  use  of  inputs.  In  view  of  this  change,  1  next  show  how  the  new  Soviet  incentive  program 
can  be  extended  to  deal  with  a  situation  in  which  the  inputs  used  by  the  enterprise  are  choice 
variables  with  associated  cost.  This  extension  has  some  similarities  to  the  analyses  of  Bonin 
and  Marcus  [4],  Snowberger  [14],  and  Miller  and  Thornton  [11]  who  have  investigated  the 
implications  of  a  relationship  between  output  and  variations  in  the  producer’s  effort. 

An  analysis  of  the  U.S.  incentive  program  will  follow  the  discussion  of  the  Soviet  incen¬ 
tive  system.  To  ease  comparison  between  the  two  systems,  a  cost-effectiveness  model  will  be 
used.  Thus,  I  assume  that  the  objective  of  the  goverment  is  the  achievement  of  some  specified 
level  of  performance  at  minimum  cost.  The  large  degree  of  uncertainty  that  exists  during 
engineering  development  prevents  the  government  from  knowing  in  advance  what  performance 
level  will  be  achieved  for  any  level  of  expenditure.  Furthermore,  both  the  estimate  of  the 
actual  performance  level,  the  target,  and  the  performance  level  actually  achieved  have  associ¬ 
ated  costs  that  must  be  borne  by  the  government  rather  than  the  producer.  Therefore,  the  per¬ 
formance  incentive  can  be  viewed  as  a  method  of  motivating  the  producer  to  take  appropriate 
account  of  these  costs  during  engineering  development.  This  internalization  of  social  costs  by 
the  producer  suggests  a  relationship  not  only  to  the  control  of  environmental  externalities,  but 
also  to  the  design  of  a  transfer  price  by  one  division  of  an  enterprise  that  is  providing  some 
intermediate  good  to  a  second  division.  The  transfer  pricing  problem  has  been  recently 
analyzed  by  Ronen  [13],  and  Groves  and  Loeb  [7], 

The  existing  incentive  system  motivates  the  producer  to  economize  on  the  costs  associ¬ 
ated  with  the  output  level  actually  achieved.  This  system  can  be  expanded  to  solve  the  target 
output  selection  problem.  This  expansion  would  lead  to  a  greater  compatibility  of  the  various 
interrelated  output  decisions,  thus  making  the  target  output  itself  a  product  worth  paying  for  in 
the  U.S. 
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I.  THE  NEW  SOVIET  INCENTIVE  MODEL 

In  analyzing  the  new  Soviet  incentive  system,  Weitzman  uses  a  model  in  which  the  fac¬ 
tors  of  production  used  by  the  enterprise  are  set  by  the  planners,  an  assumption  which  is  realis¬ 
tic  in  the  U.S.S.R.  where  inputs  have  typically  been  rationed  by  the  state.  A  tentative  target  y 
and  a  tentative  bonus  fund  B  are  assigned  to  the  enterprise  during  the  first  or  preliminary  phase. 
The  tentative  target  is  the  planner’s  best  estimate  of  the  target  output  level  at  that  time.  Dur¬ 
ing  the  second,  or  planning  phase,  the  enterprise  has  the  option  of  revising  the  tentative  target 
to  y  which  has  associated  with  it  a  revised  bonus  fund  B  computed  in  accordance  with  the  for¬ 
mula,* 

B  -  B  +  0(y  -  y), 

where  the  constant  /3  is  proportional  to  the  "real  social  value  of  having  an  extra  unit  which  has 
been  pre-planned"  [16]. 


In  the  third  or  implementation  phase,  when  the  enterprise  ends  up  producing  amount  y ,  it 
actually  receives  the  bonus  fund 


B  = 


B  +  a(y-y):y^y  (overfulfillment) 
6  -  y(y  -  y)  :  y  <  y  (underfulfillment) 


where  a  is  proportional  to  the  "real  social  value  of  having  an  extra  unit  unexpectedly 
delivered,"  and  y  is  proportional  to  the  "real  social  cost  of  being  unexpectedly  caught  short  by 
one  unit”  [16].  Subsequent  analysis  reveals  that  correct  decision  making  by  the  enterprise 
requires  that  the  constant  /3,  a,  and  y  be  in  the  same  proportion  to  their  respective  value 
coefficients.  Under  the  "old"  Soviet  incentive  system,  B  and  y  were  fixed  by  the  planners. 
Under  the  new  system,  they  are  set  by  the  enterprise. 


In  the  model  developed  by  Weitzman,  there  is  uncertainty  during  the  planning  phase  as  to 
the  amount  of  output  that  will  actually  be  produced  with  the  fixed  inputs.  Only  the  producer 
knows  the  probability  density  function  /(y).  Thus,  we  have  an  example  of  the  informational 
asymmetry  which  is  so  prevalent  during  the  planning  process  and  a  justification  for  the  producer 
to  select  the  target  output  level.  This  uncertainty  might  in  fact  persist  during  the  implementa¬ 
tion  phase,  Kut  with  fixed  inputs,  actual  output  y  is  not  a  choice  variable,  and  thus,  the  charac¬ 
ter  of  the  uncertainty  that  applies  then  is  not  relevant  to  this  analysis. 

During  the  planning  phase,  when  y  is  selected,  the  problem  faced  by  the  enterprise 
(assumed  risk  neutral)  is  to  choose  y  to  maximize 

X.OO  t B  +  p(y  -  y)  +  y(y  -  y)]  f(y)dy 
+  J.  IB  + /3(y  -  y)  +  a(y  -  y)]  f(y)dy. 

By  differentiating  with  respect  to  y,  Weitzman  shows  that  the  optimal  solution  to  this  maximiza¬ 
tion  problem  is  to  select  y  such  that 

(2)  P(y  >  y)  = 

y  —  a 

where 


•If  the  selection  of  the  target  ^influences  the  allocation  of  inputs  to  the  enterprise,  then  the  producer  will  take  this  into 
account  in  the  selection  of  y.  This  raises  a  serious  incentive  compatability  problem  which  has  been  discussed  by  Magat 
and  Loeb  (101.  We  assume  that  such  a  relationship  does  not  exist. 
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P(y  >  y)  -  /.  f(y)dy. 

Because  it  is  possible  to  multiply  all  of  the  coefficients  by  a  constant  without  changing  (2), 
only  the  relative  magnitudes  of  the  coefficients  matter  in  determining  the  optimal  y.  The 
appropriate  relative  magnitudes  are  achieved  when  these  coefficients  are  in  the  same  proportion 
to  their  respective  value  coefficients.*  Furthermore,  in  view  of  the  fact  that  (2)  must  be  posi¬ 
tive,  this  incentive  system  is  meaningful  only  when  the  coefficients  are  set  such  that 
a  <  B  <  y. 

An  Inventory  Theory  Interpretation 

The  fact  that  there  are  costs  borne  by  the  center  when  the  actual  outcome  is  both  below 
and  above  target  suggests  that  an  inventory  theoretic  interpretation  can  be  given  to  Weitzman’s 
analysis.  To  see  the  classical  inventory  structure  of  this  problem,  rewrite  (1)  as 

(3)  B  +  p(y  -  y)  +  J_ooy0'  -  y)f(y)dy  +  J.  a(y  -  y)f(y)dy. 

The  difference,  B  -  By,  is  fixed  and  therefore  not  relevant  when  choosing  the  target  output 
level,  but  we  must  concern  ourselves  with  the  term  By  which  can  be  written  as 

By  =  B  r  yfWdy  +  B  V  (y  -  y)f(y)dy  -  B  f.  (v  -  y)f(y)dy- 

The  expression, 

(4)  ~bJ.  O'  -  y)f(y)dy, 

is  proportional  to  the  benefits  foregone ,  weighted  by  the  probabilities,  as  a  result  of  the  economic 
system  not  being  geared  to  a  higher  y  when  the  actual  output  is  larger  than  the  target. 

If  the  target  is  not  achieved,  the  actual  y  is  less  than  y,  and 

(5)  B  f  O'  -  y)  f(y)dy 

is  applicable.  This  expression  can  be  viewed  as  (proportional  to)  the  benefits  still  received 
(weighted  by  probabilities)  from  having  the  system  geared  to  y.  Now  insert  (4)  and  (5)  into 
the  last  two  parts  of  (3),  and  obtain  as  the  producer’s  problem  the  maximization  of 

( B  -  y)Jj^(y  -  y)f(y)dy  +  (a  -  /3)J*.  (y  -  y)f(y)dy 

which  is  equivalent  to  the  minimization  of 

(y-/3)  r  (y  -  y)f(y)dy  +  (fi  ~  a)  f.  (y  -  y)f(y)dy. 

J  — OO  y 

The  coefficient  y  -  B  can  now  be  identified  as  (proportional  to)  the  net  social  cost  per 
unit  of  output  actually  achieved  below  the  target  and  j8-a  as  (proportional  to)  the  net  social 
cost  per  unit  of  output  above  the  target. 


‘Each  coefficient  of  the  right  hand  side  of  (2)  can  be  multiplied  by  a  constant  k  yielding 

ky  -  kp  _  k(y  -  /8)  _  y  -  fi 
ky  -  ka  Arty  -  a)  y  -  a 

In  that  the  value  coefficients  are  measured  in  rubles  per  extra  output,  multiplying  each  coefficient  by  a  constant  can  be 
viewed  as  a  change  in  the  monetary  unity  which  could  never  affect  the  selection  of  .v.  Also,  note  that  the  units  associat¬ 
ed  with  each  coefficient  of  the  right  hand  side  of  (2)  cancel.  As  the  left  hand  side  of  (2)  is  a  probability  (a  pure 
number),  such  a  cancellation  is  required  to  equate  both  sides  of  (2). 
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Let  us  use  the  notational  convention 
ci-y-fi. 

c2  =  P~  a. 

The  problem  facing  the  enterprise  can  therefore  be  written 
(6)  Min  C(  fy(y  —  y)f(y)dy  +  c2  f  O'  -  y)f{y)dy. 

■  J- oo  J  y 


When  the  producer’s  maximization  problem  (1)  is  rewritten  as  the  minimization  problem 
(6) ,  it  is  possible  to  view  the  selection  of  y  as  the  selection  of  the  amount  of  a  good  (the  target) 
to  be  placed  in  inventory.  The  coefficient  Cj  can  be  viewed  as  the  carrying  cost  per  unit  of 
unsold  inventory  and  c2  the  per  unit  shortage  cost.  Taking  the  derivative  of  (6)  with  respect  to 
y  (and  equating  it  to  zero)  we  see  that  for  the  optimal  solution  value  y. 


(7) 


P{y  <  y)  - 


c2  _  P  —  a 
C\  +  c2  y-a' 


Equation  (7)  is  a  well-known  formula  from  inventory  theory  (5].  Therefore, 


P(y  >  y)  - 


c,  +  c2 


which  is  the  solution  obtained  by  Weitzman. 


y-p 

y  —  a  ' 


The  inventory  formula  (7)  has  a  simple  economic  interpretation.  Letting  P  =  P(y  <  y), 
this  formula  can  be  rewritten  as 

(8)  Pc{  =  (1  -  P)  c2 

and  indicates  that  P  should  be  selected  through  the  selection  of  y,  so  that  the  expected  net 
social  cost  of  the  output  produced  less  than  the  target  and  the  output  produced  at  least  as  great 
as  the  target  are  equal.  The  reason  an  inventory  theoretical  interpretation  is  interesting  is  that 
inventory  theory  is  a  well-developed  framework  and  analogies  that  can  be  found  with  the  plan¬ 
ning  process  might  prove  fruitful  in  the  development  of  a  theory  of  economic  planning. 

Production  Inputs  Variable 

Although  Weitzman  has  chosen  to  view  production  inputs  as  fixed,  largely  because  this 
assumption  reflects  the  Soviet  planning  environment,  it  is  possible  to  extend  his  analysis  by 
allowing  the  production  inputs  used  during  the  implementation  phase  to  be  choice  variables  of 
the  enterprise.  This  extension  may  have  relevance  to  the  Soviet  planning  problem  now  that  the 
managers  of  state  enterprises  are  being  given  greater  flexibility  in  the  use  of  inputs.  In  order  to 
simplify  the  analysis,  I  assume  that  there  is  the  same  degree  of  uncertainty  about  the  conditions 
of  production  during  the  planning  phase  when  the  enterprise  selects  the  target  output  level  and 
during  the  implementation  phase  when  a  level  of  cost  expenditure  is  selected.  In  the  view  of 
the  enterprise,  the  conditions  of  production  during  both  phases  can  be  represented  by 

y  -  h(c,  0), 

where  9  is  a  random  variable  with  density  function  f(9)  applicable  for  both  the  planning  phase 
and  the  implementation  phase,  and  c  represents  production  costs.  Although  identical  uncer¬ 
tainty  permits  one  to  view  the  enterprise  as  selecting  y  and  c  simultaneously,  a  meaningful 
economic  interpretation  can  still  be  given  for  the  need  to  select  y  beforehand  by  assuming  that 
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the  actual  output  y  is  not  revealed  at  the  time  c  is  selected  but  rather  at  some  later  time  which 
can  be  called  the  implementation  phase.  It  will  also  be  assumed  in  this  extension  that  the  share 
of  cost  expenditure  borne  by  the  enterprise  is  equal  to  s. 


Assuming  that  the  coefficients  fi,  y,  and  a  continue  to  apply,  and  the  profits  are  deter¬ 
mined  by  T(y,c),  the  producer  must  solve 

(9)  Max  T(y,c)  =  f ‘r'°',c>  [fi  +  p(y  -  y)  +  y(h(c,0)  -  j»  -  sc)f(9)d0 

y.c  J  ““ 

+  S  I,-  ,  ^  +  /3( V  -  y )  +  a(h(c,9)  -  y)  -  sc]f(0)d0 

^9  '(j-.c) 

where  the  inverse  function  0~i(y.c)  determines  the  value  of  0  which  achieves  y  -  y  when  the 
production  costs  are  c.  The  enterprise  must  set  the  derivative  of  this  with  respect  to  y  equal  to 
zero  obtaining 

0W  *  03  -  y)f(9)d0  +  03  ~  a)f(0)d0  «  0. 

It  is  easy  to  verify  that  this  equality  implies  that 

(10)  P(0  >  <r'(P,c))=  ilzJL. 

y  —  a 

Similarly,  the  derivative  of  (9)  with  respect  to  c  set  equal  to  zero  yields* 

■§7  “  /I  ^C)  l yhe(c,0)  -  s]f{9)d0 
(H)  +  f00  lahc(c,9)  -  s]f(9)d9  -  0. 

To  obtain  qualitative  results  we  require  knowledge  of  the  function  h(c,0).  Assume  that  the 
uncertainty  is  additive  and  that  hc  depends  only  on  c(hc0  =  0).t  Then  it  can  be  shown  that  (11) 
implies  that 


(12) 


P(o  >  e-'CP.c)) 


yK  -  S 
(y  -  a)hc  ' 


For  both  (10)  and  (12)  to  be  satisfied  simultaneously,  it  must  be  true  that 


y  —  _  yhc  -  s 

y  —  a  (y  —  a)hc 


which  implies  that  the  producer  must  set 

(13)  fihc(c)  =  s. 

This  condition  implies  that  when  y  is  optimal,  the  selection  of  the  level  of  cost  by  the  enterprise 
can  be  determined  by  evaluating  the  profit  from  a  small  adjustment  in  y.  The  effect  on  the 
profit  obtained  from  y  captured  when  the  optimal  value  of  y  is  selected.  Thus,  the  producer 
should  vary  c  until  the  extra  profit  associated  with  a  small  increase  in  target  output  (fihc)  just 
equals  the  reduced  profits  from  increasing  c  by  one  unit  (s). 


One  can  rewrite  (13)  as 


c  -  hc  '(s/p). 


'Notation  such  as  h,  represents  the  partial  derivative  of  the  function  h  with  respect  to  the  variable  r. 

+ Although  this  assumption  is  strong,  it  is  frequently  interesting  to  know  what  assumptions  are  required  to  obtain  a 
sharp  characterization  of  an  optimal  policy.  It  is  not  difficult,  however,  to  imagine  an  interaction  between  the  level  of 
cost  expenditure  and  the  random  variable  9.  For  example,  high  levels  of  cost  expenditure  might  be  associated  with 
greater  uncertainty.  Such  interactions  have  been  excluded  from  the  analysis. 
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thereby  permitting  (10)  to  be  written  as 

(,4)  P(9  >  9-l(y,hTl(s/l 3))  - 

y  —  a 

The  enterprise  must  satisfy  this  condition  during  the  planning  phase  when  selecting  y.  This 
condition  recognizes  that  during  the  implementation  phase  the  producer  selects  the  optimal  cost 
expenditure.  Comparing  (14)  with  (2)  shows  that  the  producer  must  simply  account  for  the 
impact  of  the  additional  choice  variable  (cost)  on  the  likelihood  of  being  over  target.  However, 
once  this  adjustment  is  made,  the  economic  interpretation  described  by  (8)  continues  to  apply. 

II.  THE  U.S.  INCENTIVE  MODEL 

The  purpose  of  the  U.S.  incentive  model  is  to  motivate  producers  to  select  a  performance 
level  that  is  socially  optimal.  The  DOD  and  NASA  Guide  states  that 

the  concept  of  multiple  incentive  contracting  must  quantitatively  relate  profit  motivation 
directly  and  in  accordance  with  the  Government’s  objectives.  ...  it  establishes  the 
contractor’s  profit  in  direct  relationship  to  the  value  of  the  combined  level  of  performance 
in  all  areas  [5]. 

Furthermore, 

the  process  of  including  performance  in  an  incentive  structure  must  logically  begin  with 
the  determination  of  the  "value"  of  the  characteristics  which  will  be  incentivized.  The  mul¬ 
tiple  incentive  contract  should  reflect  the  importance  to  the  government  of  various  cost, 
schedule,  and  performance  outcomes,  through  the  profits  assigned  to  each  part  of  the  mul¬ 
tiple  incentive  structure  [5], 

Cost-Effectiveness  Analysis 

One  method  of  describing  the  U.S.  incentive  model  is  to  use  a  cost-effectiveness  analysis 
approach.*  This  approach  applies  when  the  government’s  objective  is  the  achievement  of  some 
specified  level  of  system  performance  at  minimum  cost  and  it  simplifies  comparison  of  the  U.S. 
incentive  model  with  the  Soviet  model.  It  is  assumed  that  increasing  performance  level  p  of 
some  component  of  the  system  during  the  engineering  developing  phase  of  procurement  leads 
to  future,  or  "downstream"  cost  savings  for  the  government  because  of  reduced  acquisition  cost, 
maintenance  costs,  etc.  The  basic  structure  of  the  U.S.  incentive  model  can  be  most  easily 
illustrated  if  it  is  assumed  that  the  producer  is  given  a  performance  reward  based  on  the  level  of 
p  actually  achieved  and  on  development  costs.  Later,  a  more  complicated  model  will  show  how 
the  U.S.  incentive  program  can  be  expanded  to  incorporate  the  target  selection  features  of  the 
new  Soviet  incentive  program. 

I  assume  that  the  cost  of  development  function,  C(p),  is  deterministic  during  the  imple¬ 
mentation  phase  when  the  producer  actually  selects  p.  This  function  may,  however,  be  known 
only  to  the  producer.  Indeed,  in  order  to  justify  using  a  performance  incentive  in  the  first 
place,  there  must  be  some  uncertainty  in  the  government’s  mind  about  the  cost  of  development 
at  the  time  the  incentive  is  specified.  Otherwise,  the  government  would  simply  specify  p.  The 


*As  far  as  I  am  aware,  the  first  mathematical  treatment  of  multiple  incentive  contracting  using  a  cost-effectiveness  ap¬ 
proach  similar  to  the  one  presented  here  was  by  Ackerman  and  Krutz  111. 
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downstream  cost  function.  Dip),  determines  the  costs  borne  by  the  government  through  the 
dependence  on  the  output  level  selected  by  the  producer.  This  function  is  assumed  to  be  known 
by  the  government.  Total  cost  is  the  sum  of  the  development  cost  and  the  downstream  cost 
and  is  designated  Tip).  At  the  time  of  producer  decision  making,  the  objective  of  the  govern¬ 
ment  is  to  solve 

Min  Tip)  «  Cip)  +  Dip). 

t> 

The  first  order  condition  for  this  problem  is 

(15)  C  =  —  D’, 

which  simply  says  that  the  performance  level  should  be  increased  until  the  producer’s  marginal 
development  cost  expenditure  just  equals  the  government’s  marginal  downstream  cost  reduc¬ 
tion. 

The  profit,  or  performance  incentive  function  given  to  the  producer  under  the  U.S.  incen¬ 
tive  system  is  typically  of  the  form 

(16)  it  =  Gip)  -  sC 

where  Gip)  represents  dollars  of  profit  earned  as  a  function  of  the  performance  level  p,  and  s 
equals  the  share  of  the  development  cost  borne  by  the  producer.  The  relevant  first  order,  or 
profit  maximizing  condition  for  the  producer  is 

G'ip)  -  sC'ip). 

In  view  of  the  government’s  optimization  condition  (15),  the  optimal  incentive  structure  is 
obtained  when  —D'  is  substituted  for  C\  and  the  government  constructs  the  performance 
incentive  function  such  that 

G'ip)  =  -  sD  ip). 

Therefore,  with  the  inclusion  of  a  constant  A,  a  performance  incentive  function  of  the  form 

it  =  —  sD  ip)  —  sC  +  A 

will  motivate  the  producer  to  satisfy  (15),  thereby  satisfying  the  objectives  of  the  government.* 

Extending  the  U.S.  Incentive  Model 

For  .selected  U.S.  procurements  in  which  the  producer  is  the  only  supplier  of  a  good 
whose  performance  is  rewarded  in  relation  to  some  target  (thereby  creating  an  incentive  for  the 
producer  to  understate  the  target  if  simply  asked  its  level),  there  is  value  in  extending  the  U.S. 
incentive  program  to  include  producer  target  specification.  In  addition  to  depending  on  the 
actual  performance  level,  downstream  costs  also  depend  on  the  target  performance  level 
because  of  the  time  needed  to  prepare  the  operational  environment  (e.g.,  train  maintenance 
people,  etc.)  for  the  actual  performance  level. 

In  extending  the  U.S.  incentive  model,  I  assume,  for  the  purpose  of  comparison  with  the 
Soviet  incentive  model,  that  during  the  planning  phase  an  incentive  function  is  specified  and 
the  producer  selects  a  target  performance  level  p.  The  actual  performance  level  p  is  not 
achieved  until  an  implementation  phase. 


*N.  J.  Ireland  (91  has  shown  that  the  "ideal  price”  is  the  benefit  function  of  the  central  planning  organization  for  certain 
types  of  planning  environments.  The  function  Dip)  can  be  interpreted  as  the  benefit  function  of  the  planners  for  the 
problem  at  hand.  Note,  however,  that  in  this  analysis,  the  cost  sharing  ratio  is  also  applied  to  the  function  Dip). 
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During  the  planning  phase,  the  downstream  cost  function  will  be  of  the  form  D(p,p). 
Although  all  costs  are  variable  at  that  time,  certain  downstream  costs  are  fixed  at  the  time  the 
actual  performance  is  achieved.  During  the  planning  phase  the  dependence  of  these  costs  on 
the  target  performance  level  can  be  represented  by  F(p).  Those  costs  which  remain  variable 
when  the  actual  performance  level  is  determined  can  be  represented  by  D\ ( p,p ). 

In  order  to  parallel  the  extension  of  the  new  Soviet  incentive  model  to  the  situation  where 
inputs  are  variable,  I  now  assume  that  the  choice  variable  of  the  producer  is  a  level  of  develop¬ 
ment  cost  expenditure  c.  In  the  cost-effectiveness  analysis  section  above,  the  performance  level 
p  was  selected  as  the  producer’s  choice  variable.  At  both  the  time  the  producer  selects  the  tar¬ 
get  performance  level  and  the  time  that  a  cost  expenditure  level  is  selected,  the  producer’s  view 
of  the  conditions  of  production  is  represented  by 

p  -  g(c,0), 

where  9  is  a  random  variable  which  has  the  same  density  function  at  both  of  the  times  of  pro¬ 
ducer  decision  making.  It  is  assumed  that  the  government  does  not  know  g  during  the  planning 
phase,  thus  justifying  the  selection  of  p  by  the  producer.  As  we  shall  see,  the  government’s 
information  about  downstream  cost  is  transmitted  to  the  producer  in  the  incentive  function. 
This  information  combined  with  the  producer’s  information  about  the  conditions  of  production 
yields,  via  profit  maximization,  the  best  solution  to  the  target  selection  problem. 

The  government  in  taking  a  social  view  is  interested  in  the  minimization  of 

E(c  +  D\{p,p)  +  F(p)). 

The  first  order  conditions  associated  with  this  minimization  are 
(D  1  +  E(bD\/bp)  ( bp/bc )  =  0 

(18)  E(bD\/bp)  +  bF(p)/bp  =  0. 

The  profit  function  given  to  the  producer  is  of  the  form 

(19)  it  =  G(p,p)  —  sc, 

where  s  again  represents  the  share  of  the  development  cost  borne  by  the  producer.  This  func¬ 
tion  has  the  same  basic  form  as  (16)  to  retain  compatibility  with  what  has  typically  been  used 
for  the  existing  U.S.  incentive  system.  The  first  order  conditions  which  apply  for  the  producer 
are 

(20)  E(bG/bp)  =  0, 

(21)  E(bG/bp)  ( bp/bc )  -  s. 

Comparing  (17)  and  (18)  with  (19)  and  (20)  shows  that  the  government  can  achieve  its  objec¬ 
tive  if  it  constructs  an  incentive  function  such  that 

(22)  Gfi  =  -  s(D\p  +  Fp), 

(23)  Gp  =  —  s(Dp). 

When  condition  (22)  is  satisfied,  the  incentive  profit  received  by  the  producer  from  a  change  in 
the  target  performance  level  is  just  equated  to  a  proportion  of  the  incremental  downstream  cost 
savings.  A  similar  interpretation  applies  to  (23).* 


'The  expectation  operators  are  not  required  in  (22)  or  (23)  because  the  terms  inside  the  expectation  operators  of  (20) 
and  (21)  are  substituted  for  the  terms  of  (17)  and  (18)  inside  these  operators. 
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Thus, 


where  A  is  a  constant. 


G(p,p)  =  -  sD(p,p)  +  A, 


By  linearizing  the  function  Dv  (p,p)  about  p,  and  F(p)  about  p ,  where  p  is  some  specified 
performance  level,  e  g.,  a  government  estimate  of  the  target  performance  level,  one  can  obtain 
a  formal  equivalence  of  the  U.S.  and  the  Soviet  incentive  models.  Thus,  if  one  approximates 
D\(p,p)  by 

K/s  +  a/s(p  -  p)  when  p  ^  p 
K/s  +  y/s(p  -  p)  when  p  <  p. 

and  F (p)  by 


M/s  +  p/s(p  -  p). 


then 


-  sD(p.p) 


-  (K  +  M)  +  a(p  -  p)  +  p(p  -  p)  :  p  >  p)\ 

-  (K  +  M)  +  y(p  -  p)  +  p (p  -p):p<  p). 


The  parameters  a,  y,  and  p  have  the  same  interpretation  as  in  the  new  Soviet  incentive  model. 
For  example,  in  that  p  is  a  preplanned  performance  outcome,  p  is  simply  proportional  to  the 
social  value  of  having  an  extra  unit  which  has  been  preplanned  and  can  be  identified  as  a  pro¬ 
portion  of  the  cost  savings  achieved  when  p  is  varied  during  the  planning  phase. 


If  inputs  are  fixed  as  assumed  by  Weitzman,  the  term  sc  vanishes  from  the  profit  function 
(19),  and  one  obtains  an  equivalence  to  the  new  Soviet  incentive  model.  If  the  inputs  are  vari¬ 
able  and  g(c,0)  applies,  then  one  obtains  an  equivalence  to  the  extended  Soviet  incentive 
model  developed  above.  The  relevant  maximization  problem  that  must  be  solved  by  the  pro¬ 
ducer  is  analogous  to  (9). 


Note  that  the  cost  share  s  is  the  factor  of  proportionality  that  applies  to  the  parameters  a, 
y,  and  p.  If  the  cost  share  changes,  then  so  too  will  the  parameters.  Thus,  there  appears  to  be 
a  degree  of  freedom  in  the  selection  of  these  parameters.  However,  this  factor  of  proportional¬ 
ity  has  distributional  significance  and,  in  fact,  is  related  to  the  distribution  of  societal  profits 
between  the  center  and  the  enterprise.  It  is  subject  to  optimization  in  an  analysis  of  risk  shar¬ 
ing  between  the  center  and  the  producer,  and  has  been  discussed  by  Hildebrandt  and  Tyson  [8] 
who  show  that  under  certain  types  of  differentiated  information  structures,  it  is  still  possible  to 
achieve  both  the  distributional  and  allocational  objectives  of  the  planner. 


CONCLUSIONS 


Decision  makers  in  both  the  United  States  and  the  Soviet  Union  face  similar  problems  of 
correctly  guiding  production  at  the  enterprise  level.  To  achieve  certain  social  objectives,  the 
United  States  government  has  employed  the  contractual  incentive  function  whereas  the  Soviet 
planners  have  used  the  non-contractual  or  unilarteral  incentive  function. 

The  new  Soviet  incentive  system  provides  an  incentive  for  the  enterprise  to  reveal  the 
socially  optimal  target  output  level.  My  analysis  has  shown  that  this  system  can  be  expanded  to 
deal  with  the  situation  when  the  enterprise  controls  the  amount  of  resources  utilized,  a  situa¬ 
tion  which  is  becoming  increasingly  typical  in  the  Soviet  Union  and  which  continues  to  be  the 
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norm  in  the  United  States.  Although  the  option  of  placing  an  incentive  on  the  target  output 
level  has  not  yet  been  used  in  the  United  States,  the  existing  U.S.  incentive  system  can  be 
expanded  to  permit  that  possibility. 
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ABSTRACT 

This  paper  studies  combat  between  two  homogeneous  forces  modelled  with 
variable-coefficient  Lanchester-type  equations  of  modern  warfare  with  support¬ 
ing  fires  not  subject  to  attrition.  It  shows  that  this  linear  differential-equation 
model  for  combat  with  supporting  fires  may  be  transformed  into  one  without 
the  supporting  fires  so  that  all  the  previous  results  for  variable-coefficient 
Lanchester-type  equations  of  modern  warfare  (without  supporting  fires)  may  be 
invoked.  Consequently,  new  important  results  for  representing  the  solution 
(i.e.  force  levels  as  functions  of  time)  in  terms  of  canonical  Lanchester  func¬ 
tions  and  also  for  predicting  force  annihilation  are  developed  for  this  model 
with  supporting  fires.  Important  insights  into  the  dynamics  of  combat  between 
two  homogeneous  forces  with  such  supporting  fires  are  discussed. 


1.  INTRODUCTION 

Today  military  operations  analysts  commonly  use  deterministic  Lanchester-typet  models 
for  developing  insights  into  the  dynamics  of  combat.  Militarily  realistic  computer-based 
Lanchester-type  models  of  quite  complex  military  systems  have  been  developed  for  almost  the 
entire  spectrum  of  combat  operations,  from  combat  between  battalion-sized  units  13],  [7]  to 
theater-level  operations  [5],  [6],  Nevertheless,  a  simple  combat  model  may  yield  a  clearer 
understanding  of  significant  interrelationships  that  are  difficult  to  perceive  in  a  more  complex 
model,  and  such  insights  can  subsequently  provide  valuable  guidance  for  more  detailed  compu¬ 
terized  investigations  (see  [2],  [18]).  In  this  paper  we  consider  such  a  simplified  variable- 
coefficient  Lanchester-type  model  of  combat  between  two  homogeneous  forces  with  supporting 
fires  not  subject  to  attrition,  and  develop  important  results  concerning  the  representation  and 
behavior  of  its  solution.  These  theoretical  results  are  shown  to  provide  important  insights  into 
the  dynamics  of  this  combat  situation. 

Thus,  the  model  that  we  study  is  important  because  it  yields  some  important  insights  into 
the  effects  of  supporting  fires  on  the  dynamics  of  combat  (see  [11],  [15],  and  [17]).  Our  work 
here  extends  and  unifies  previous  results  of  a  number  of  authors  [1],  [11],  and  [13]  through 

‘This  research  was  partially  supported  by  the  Office  of  Naval  Research  (both  through  direct  funding  and  also  through 
the  Foundation  Research  Program  at  the  Naval  Postgraduate  School)  and  partially  by  the  U.S.  Army  Research  Office, 
Durham,  North  Carolina,  under  R  &  D  Project  No.  1L161 102H57-05  Math  (funded  with  MIPR  No.  ARO  7-79). 
t  So-called  after  pioneering  work  by  F.  W.  Lanchester  110]. 
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(17).  Taylor  and  Parry  (17)  have  considered  the  same  model  and  developed  more-restrictive 
victory-prediction  conditions  for  fixed-force-ratio-breakpoint  battles  by  considering  the  force- 
ratio  equation  (see  also  (111).  Taylor  and  Brown  (14)  have  developed  a  mathematical  theory 
for  solving  variable-coefficient  Lanchester-type  equations  of  modern  warfare  (without  support¬ 
ing  fires)  and  introduced  canonical  hyperbolic-like  Lanchester  functions  for  constructing  their 
solution.  Taylor  and  Comstock  (16)  extended  this  work  by  developing  theoretical  results  for 
predicting  force  annihilation  from  initial  conditions  without  having  to  spend  the  time  and  effort 
to  explicitly  compute  force-level  trajectories  for  the  model  without  supporting  fires. 

In  the  paper  at  hand  we  show  that  the  variable-coefficient  model  with  supporting  fires  may 
be  transformed  into  the  one  without  them  so  that  all  the  known  results  about  the  latter  may  be 
applied:  representation  of  solution  in  terms  of  canonical  Lanchester  functions,  nonoscillation  of 
the  solution,  force-annihilation-prediction  conditions,  explicit  calculation  of  annihilation  time, 
etc.  We  consequently  can  translate  all  these  results  to  the  case  of  combat  with  supporting  fires. 
As  a  result  of  our  work  here,  the  theory  of  the  model  with  supporting  fires  may  be  considered 
to  be  as  complete  as  that  of  the  model  without  them.  Consequently,  one  can  now  study  this 
variable-coefficient  model  almost  as  easily  and  thoroughly  as  Lanchester’s  original  simple 
constant-coefficient  model  without  supporting  fires. 

2.  NOTATION 


The  symbols  that  are  used  in  this  paper  are  defined  as  follows. 


a,  b,  a,  fi 


constant  attrition-rate  coefficients. 


aft),bft),  =  /time-dependent  attrition-rate  coefficients,  the  first  two  are 
aft),  /3(t)  \taken  to  be  given  in  the  form  aft)  -  kagft)  and  bft)  -  kbhft). 


Aft),  Bft)  =  time-dependent  attrition-rate  coefficients  in  the  transformed 
model  (5.2);  given  by  (5.3), 


Cpft),  Spft) 


Cxft),  Sxft) 


ka,  kb 


hyperbolic-like  general  Lanchester  functions  fGLF)  which 
are  linearly-independent  solutions  to  the  transformed 
P  force-level  equation  (5.5);  they  are  analogous  to  the  hyperbolic 
cosine  and  hyperbolic  sine  respectively,  and  their  quotient  is  denoted  as 
Tpft)  -  Spft)/ Cpft)\CQft)  and  SqU)  are  similarly  defined, 

hyperbolic-like  GLF  which  are  linearly-independent  solutions 
to  the  X  force-level  equation  (4.3);  they  are  analogous  to  the  hyperbolic 
cosine  and  hyperbolic  sine  respectively,  and  their  quotient  is  denoted  as 
Tx(t)  -  Sx(r)/Cx(t);Cy(t)  and  Syft)  are  similarly  defined, 

positive  constants  ("scale"  factors)  used  for  the 
representation  of  aft)  and  bft), 


pft),  qft)  =  transformed  "force-level"  variables  corresponding  to  xU)  and  yft), 

respectively  and  related  to  them  by  (5.1);  with  initial  values  x0  and.y,,, 
p  -  dp/  dt  —  dp/ dt. 


Q*  —  parity-condition  parameter  for  the  model  without  supporting  fires  (4.1); 
defined  by  (4.7), 
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t  -  battle  time,  with  /  «  0  denoting  the  beginning  of  battle, 

f*  =  time  at  which  the  X  force  is  annihilated,  i.e.  xit%)  =  0, 

t0  =  largest  finite  time  at  which  ait)  or  bit)  ceases  to  be  defined, 

positive,  or  continuous;  we  take  r0  “  0  when  no  such  finite  time  exists. 


xit),  _v (r ) 


force  levels  of  X  and  Y  at  time  t  for  the  model  with  supporting 
fires  (3.1);  with  initial  values  x0  and_y0, 


Xit),  Yit) 


force  levels  of  X  and  Y  at  time  t  for  the  model  without 
supporting  fires  (4.1);  with  initial  values  Xq  andT'o, 


zit)  =  xit)/yit), 

X  /  =  \/ kakh  and  is  called  the  combat-intensity  parameter, 

kR  =  kjkh  and  is  called  the  relative-fire-effectiveness  parameter, 

A  *  =  parity-condition  parameter  for  the  model  with  supporting  fires  (3.1);  defined 

by  (6.4), 

0  =  yJ  ab  +  [(a  —  /3 )/ 2]^. 


3.  COMBAT  MODELLED  BY  VARIABLE-COEFFICIENT  LANCHESTER-TYPE 
EQUATIONS  OF  MODERN  WARFARE  WITH  SUPPORTING  FIRES 

We  consider  combat  modelled  by  the  following  Lanchester-type  equations 

=  -ait)y  -  /3(f)x  with  x(0)  =  x0, 
at 

(3.1)  dv 

-f  =  -bit)x  -  a(r)>»  with  .y(O)  =  y0. 
at 

where  t  =  0  denotes  the  time  at  which  the  battle  begins,  xit)  and  y  it)  the  numbers  of  X and  Y 
at  time  r,  and  ait),  bit),  ait),  and  /3(f)  denote  time-dependent  Lanchester  attrition-rate 
coefficients,  which  represent  the  effectiveness  of  each  side’s  fire.  In  any  analysis  of  combat,  we 
should  use  the  above  equations  only  for  x  and  y  >  0  and,  for  example,  set  dx/dt  =  0  when 
x  -  0,  since  negative  force  levels  have  no  physical  meaning.  However,  for  studying  the 
mathematical  properties  of  the  functions  defined  by  these  differential  equations,  we  will  find  it 
more  convenient  to  ignore  this  restriction  and  assume  that  (3.1)  holds  for  all  values  of  x  and  y. 

Two  situations  that  have  been  hypothesized  to  yield  the  above  equations  are: 

(51)  "aimed-fire"  combat  between  two  homogeneous  forces  with  "operational"  losses  (11,  [91, 

(52)  "aimed-fire”  combat  between  two  homogeneous  (primary)  forces  with  superimposed 
effects  of  supporting  fires  not  subject  to  attrition  [17]  (see  Figure  1). 

The  practical  use  of  such  equations  in  analysis  depends  on  one’s  ability  to  obtain  realistic  values 
for  the  coefficients:  the  prediction  of  the  attrition-rate  coefficients  from  weapon-system- 
performance  data  has  been  discussed  by  Bonder  and  Farrell  [21,  (see  also  [14l,  and  [171). 
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In  our  study  here  of  the  behavior  of  solutions  to  (3.1),  we  make  the  following  assump¬ 
tions  about  the  attrition-rate  coefficients: 

(Al)  ait)  and  bit)  are  defined,  positive,  and  continuous  for  /0  <  t  <  +  °°  with  fo  ^  0, 

( A2)  a  it)  and  0  (r)  ^  0  for  r0  <  t  <  +  °°  , 

(A3)  a(t),  bit), ait),  and/3 (r)  €  L(t0,T )  for  any  finite  T. 

J.r 

(  ait)  dt  exists  (and  is  given  by  a 
finite  quantity).  It  follows  that  (and  we  will  assume  so  below)  a(t)&  Li 0, +°°)  means  that 

J.r 

ait)dt  —  +  oo. 

o 

We  further  take  ait)  and  bit)  to  be  given  in  the  form  ait)  -  kagit)  and  bit)  -  kbhit), 
where  ka  and  kb  are  positive  constants  ("scale"  factors)  chosen  so  that  a  it) /bit)  =  ka/kb  when 
git)  -  hit)  for  all  t.  We  introduce  the  primary  weapon  systems’  combat-intensity  parameter  A , 
and  the  relative-fire-effectiveness  parameter  A  K  (for  the  primary  weapon  systems)  defined  by 

(3.2)  -  y/kakb,  and  A*  -  kjkb. 

Taylor  and  Parry t  [17]  noted  that  the  force  ratio,  z  -  x/y ,  satisfies  the  Riccati  equation 

(3.3)  -  bit)z2  +  [ait)  -  /3it))z  -  ait)  with  z(0)  -  z0  -  — , 

dt  y  o 

and  used  this  fact  to  develop  much  useful  information  about  the  behavior  and  implications  of 
the  model  (3.1).  For  the  model  (3.1)  with  a(r)  —  /3(r)  -0  for  all  t  >0  [i.e.  the  model 
without  supporting  fires  (4.1)  below],  Taylor  and  Comstock  [16]  have  developed  theoretical 
conditions  that  predict  force  annihilation  without  having  to  spend  the  time  and  effort  of  com¬ 
puting  the  entire  force-level  trruectories.  They  also  briefly  considered  (3.1)  and  (3.3)  and 
observed  that  for  identically  equal  fire  effectivenesses  of  the  supporting  weapons  [i.e. 
a(t)  -  /3(f)  for  all  t  ^  0]  the  same  Riccati  equation  is  satisfied  by  the  force  ratio  for  both  (3.1) 
and  (4.1).  Consequently,  in  terms  of  the  force  ratio,  a  battle’s  evolution  is  the  same  for  the 
two  models  (3.1)  and  (4.1),  although  the  force  levels  initially  decay  more  quickly  for  (3.1). 
Thus,  we  are  led  to  conjecture  that  there  must  be  some  kind  of  intimate  relationship  between 
the  two  models  with  and  without  supporting  fires. 

til  was  the  author’s  good  fortune  to  be  awarded  (jointly  with  S.  Parry)  the  1975  MAS  Prize  by  the  Military  Applica¬ 
tions  Section  of  the  Operations  Research  Society  of  America  for  the  three  papers  Taylor  and  Parry  1171  and  Taylor 
Ml.  1121. 
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In  this  paper,  we  will  show  that  the  model  with  supporting  fires  (3.1)  may  be  transformed 
into  the  one  without  supporting  fires  [i.e.  (4.1)1  and  use  this  fact  to  develop  a  fairly  thorough 
characterization  of  the  mathematical  nature  of  solutions  to  (3.1).  Our  results  allow  one,  in 
theory,  to  study  this  particular  variable-coefficient  model  (3.1)  almost  as  easily  and  thoroughly 
as  Lanchester’s  simple  constant-coefficient  one. 

4.  SUMMARY  OF  RESULTS  FOR  VARIABLE-COEFFICIENT  LANCHESTER-TYPE 
EQUATIONS  OF  MODERN  WARFARE  WITHOUT  SUPPORTING  FIRES 

In  this  section  we  summarize  results  about  the  model  without  supporting  fires  that  we  will 
use  in  our  study  of  the  model  with  supporting  fires.  These  results  have  appeared  in  a  scattered 
fashion  in  the  literature  (see  113],  [14],  and  [161),  and  we  summarize  them  here  in  unified 
form  for  the  reader’s  convenience.  Accordingly,  we  consider  the  following  variable-coefficient 
Lanchester-type  equations  of  modern  warfare  without  supporting  fires 

~  =  -  ait)Y  with  A'(0)  =  x0, 

(4.D  L 

- - bit) X  with  F(0)  =  >>o. 


Here  we  assume  that  the  attrition-rate  coefficients  ait)  and  bit)  satisfy  assumptions  (Al)  and 
(A3)  of  Section  3. 

Taylor  and  Brown  [14]  have  shown  that  the  X  (without  supporting  fires)  force  level  as  a 
function  of  time  may  be  written  as 

(4.2)  Xit)  =  *o(CY(0)Q(/)  -  M0)S*(r))  -WaT  ( CxiO)Sxit )  -  Sx(0)Cxir)}, 


where  the  hyperbolic-like  general  Lanchester  functions  (GLF)  Cxit )  and  Sx(l)  satisfy 

(4.3)  — fr  -«(')*(*)* -o. 

dt 2  ait)  dl  J  dt 

with  initial  conditions 


(4.4) 


Q('o)  =  1*  Sx(Iq)  —  0, 


1 

a(l0) 


0, 


_J_.  (,) _ L_ 

aU0)  dt  "0>  y/m' 


In  other  words,  these  functions  are  linearly  independent  solutions  to  the  X  force-level  equation 
(4.3)  and  are  analogous  to  the  ordinary  hyperbolic  functions  (see  [131  and  [141  for  further 
details).  We  will  refer  to  any  such  basic  pair  of  linearly-independent  solutions  (normalized  in 
an  appropriate  manner)  to  a  force-level  equation  like  (4.3)  as  general  Lanchester  functions 
(GLF).  Thus,  for  such  linear  Lanchester-type  differential  combat  equations,  the  hyperbolic-like 
GLF  play  in  variable-coefficient  combat  a  role  analogous  to  that  which  the  ordinary  hyperbolic 
functions  play  in  constant-coefficient  combat.  The  GLF  CYit)  and  SYit)  are  similarly  defined. 
The  reader  should  recall  (see  Section  3  above)  that  t0  denotes  the  largest  finite  time  at  which 
a(t)  and  bit)  ceases  to  be  defined,  positive,  or  continuous  (see  [141  and  [15]  for  further  dis¬ 
cussion).  Also,  we  will  set  tQ  -  0  if  no  such  finite  time  exists. 
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Thus,  the  hyperbolic-like  GLF  Cx,  Sx,  Cy,  and  Sy  are  basic  "building  blocks"  for  con¬ 
structing  the  solution  to  (4.1).  They  depend  on  only  the  attrition-rate  coefficients  aU)  and 
bU )  and  do  not  depend  on  the  initial  force  levels  x0  and  _y0  at  all.  Taylor  and  Brown  [14]  have 
given  examples  of  various  such  GLF  for  different  attrition-rate  coefficients  of  tactical  interest. 
Various  numerical  examples  are  also  given  in  their  work  (see  [14]  and  [15]).  We  observe  that 
(4.2)  simplifies  to 

(4.5)  X(t)  —  XgCX(t)  —  yo\/^~R  SXU), 

when  t0  —  0.  However,  Taylor  and  Brown  [14]  have  shown  that  (4.2)  simplifies  to  (4.5)  for 
to  ^  0  if  and  only  if  ait)/ bit)  =  constant  -  kj kb  for  all  t  (constant  ratio  of  attrition-rate 
coefficients).  In  other  words,  the  hyperbolic-like  GLF  only  possess  so-called  algebraic  addition 
theorems  like  the  hyperbolic  functions  do  only  for  a  constant  ratio  of  attrition-rate  coefficients. 

The  following  results  have  been  developed  elsewhere  in  the  literature.  They  form  the 
basis  for  our  subsequent  theoretical  analysis  of  the  model  (3.1). 

RESULT  4.1:  At  most  one  of  the  two  force  levels  X (t)  and  Y(t)  can  ever  vanish  in  finite 

time 


RESULTS  4.2:  If  either  ait)  &  Li 0,  +  °°)  or  bit)  S'  Li 0,  +  °°),  then  the  X  force 
(without  supporting  fires)  will  be  annihilated  in  finite  time  if  and  only  if 


(4.6) 


*o  /—  Q(0)  -  Q*SxiO) 
y  V  *  Q*CY(0)  -  SyiQ)  J’ 


where 

(4.7)  lim  {SA-(/)/Q(r)}  =  1  /Q*. 

t—+oo 

RESULT  4.3:  The  time  of  annihilation  of  the  X  force  (without  supporting  fires),  f/,  is  given 

by 

(4.8)  Tx  '({x'oC)'(O)  +  y’o  +  J'ia/^rGj-W))), 

where  Tx(t)  denotes  a  hyperbolic-like  GLF  that  is  analogous  to  the  hyperbolic  tangent  and  is  defined 
by  Tx(t)  =  SXU)/CXU). 

Results  4.2  (see  [13],  however,  for  the  proof  of  a  more  general  result)  and  4.3  were 
developed  by  Taylor  and  Comstock  [16],  although  Result  4.2  was  given  in  a  slightly  more  res¬ 
trictive  form  (namely,  both  a(t)  and  b(t)  GL( 0,  +«»)).  Let  us  now  sketch  the  proof  of 
Result  4.1  (see  also  [13]  for  a  less  detailed  proof),  which  says  that  all  solutions  to  (4.3)  are 
nonoscillatory  (see,  for  example,  Ince  [8,  p.  224]).  Multiplying  the  first  equation  of  (4.1)  by  Y, 
the  second  by  X ,  adding,  and  integrating  the  result  between  0  and  t,  we  obtain 

(4.9)  X(t)Y(t)  =  xoyo  -  J*0Ms)  +  Ms)*2(s)}</s. 

Considering  the  standard  uniqueness  theorem  for  linear  differential  equations  (see,  for  exam¬ 
ple,  Coddington  and  Levinson  [4,  p.  67]),  we  see  that  it  is  impossible  for  both  XU)  and  Y(t) 
to  be  equal  to  zero  at  any  finite  time,  since  they  then  would  have  to  be  equal  to  zero  for  all 
time.  Thus,  since  JCoy0  >  0  and  the  integral  in  (4.9)  is  strictly  increasing  in  t  and  positive  for 
/  >  0,  it  follows  that  X(t)Y(t)  has  at  most  one  finite  zero  for  t  ^  0,  and  Result  4.1  has  been 
proven. 
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Result  4.1  is  very  useful  in  the  theoretical  study  of  the  behavior  of  solutions  to  (4.1).  It 
shows  that  if  there  exists  a  finite  r/  such  that  XU f)  -  0,  then  YU )  >  0  for  all  t  >  0.  Thus,  if 
we  can  find  a  zero  for  XU),  we  know  that  YU)  is  guaranteed  to  be  positive  for  all  t.  This 
result  is  useful  in  proving  Result  4.2.  Furthermore,  when  continuous  withdrawals  for  both 
sides  (denoted  as  rU)  and  sit))  are  added  to  the  model  (4.1)  (e.g. 
dX/dt  =  —aU)Y  -  rit)  where  rit)  >  0),  then  Result  4.1  no  longer  holds  (at  least  when 
a(t),b(t),rU),  ands(/)  are  constants) . 


5.  TRANSFORMATION  OF  MODEL  WITH  SUPPORTING 
FIRES  TO  ONE  WITHOUT  THEM 


In  this  section  we  show  that  the  model  with  supporting  fires  (3.1)  may  be  transformed 
into  one  without  them  (4.1).  This  transformation  is  the  key  result  upon  which  this  paper  is 
essentially  based.  Thus,  the  substitution 


(5.1) 


p(t)  =  x(t)  exp 

[f.  P(s)ds 

q(t)  =  y(t)  exp 

{/0'a(s)</s| 

transforms  (3.1)  into 


(5.2) 


wnere 


(5.3) 


dt 

A  „  _ 

dt 


AU)q  with  p(Q)  *  xq, 
BU)p  with  <?  (0)  =  x0. 


A  U)  =  aU)  exp 


and 


(/o 

Bit)  =  bit)  exp  |  -  Jo  |/3(s)  -  a(s)Jrfsl. 


The  above  assumptions  (Al)  through  (A3)  of  Section  3  about  aU).  bit),  aU),  and/8(r) 
imply  that  A  it)  and  B(t)  have  the  same  properties  as  aU)  and  bit),  i.e.  they  € L (f0,  T)  for 
any  finite  T  and  are  defined,  positive,  and  continuous  for  to  <  t  <  -F°°.  Thus,  we  can  invoke 
all  the  results  for  the  model  without  supporting  fires  (4.1)  and  obtain  results  for  the  model  with 
supporting  fires  (3.1)  by  inversion  of  the  transformation  (5.1).  Before  we  do  this,  however,  let 
us  consider  the  representation  of  the  solution  to  (5.2). 


(5.4) 


The  transformed  "force-level"  variable  pit)  satisfies 
d2p  _  [  1  dA  [  dp  _ 


dr 


A  U)  dt  j  dt 


A  U)BU)p  “  0, 


which  may  be  written  in  the  equivalent  form 
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ls'5)  $ - {'J(,)  ~  + - aMbU)p  -  *• 
with  initial  conditions 

(5.6)  pi 0)  -  x0,  and  -^jj--^(O)  *  ~Xo> 

Hence,  by  the  results  of  Taylor  and  Brown  [14]  reported  in  the  previous  section  (i.e.  (4.2)),  the 
solution  to  (5.4)  that  satisfies  the  initial  conditions  (5.6)  may  be  written  as 

(5.7)  p(t)  -  x0{CQ(0)CPU)  -  SQiO)SPit)}  -  y<rfi^[Cp{0)SPU)  -  S>(0)C>(r)), 

where  the  hyperbolic-like  GLF  CPit)  and  SP(t)  are  linearly  independent  solutions  to  the  P 
force-level  equation  (5.5)  that  satisfy  the  initial  conditions 

(5.8)  CP(t  0)  =  1,  S„U0)  =  0. 

a  (/o)  dr  a  it0)  dr  0 

In  other  words,  C>(f)  and  S/>(r)  play  exactly  the  same  role  for  the  equation  (5.2)  that 
Cf(t)  and  Sj-(r)  do  for  (4.1).  We  have,  however,  expressed  the  P  force-level  equation  in 
terms  of  the  attrition-rate  coefficients  of  the  model  with  supporting  fires  (3.1).  We  note  again 
that  t0  denotes  the  largest  finite  time  at  which  a(t )  or  bit)  ceases  to  be  defined,  positive,  or 
continuous.  Also,  we  will  set  t0  =  0  if  no  such  finite  time  exists.  The  GLF  CQ(t)  and  SQ(r ) 
are  similarly  defined,  with  the  following  initial  condition  worthy  of  note: 

<59) 

6.  BEHAVIOR  OF  THE  MODEL  WITH  SUPPORTING  FIRES 

It  follows  from  (5.1),  (5.2),  and  the  results  of  Section  4  that  when  the  supporting  fues  aTe 
present  (i.e.  for  our  model  (3.1)  under  study),  the  X  force  level  as  a  function  of  time  is  given 
by 

(6  D  x{t)  -  U0{Co(0)C/»(t)  -  Sq(0)Sp(i)}  -  yosfa{CP(0)SP(t) 

— (0)  C>.  ( r ) }  ]  exp 


From  (5.1)  it  follows  that  for  any  finite  t 

(6.2)  pit)  -  0  if  and  only  if  x(t)  -  0,  and  git)  -  0  if  and  only  if  yit)  -  0, 

whence  Result  4.1  yields  Result  6.1. 

The  behavior  of  the  model  with  supporting  fires  is  then  largely  described  by  the  following 
results. 

RESULT  6.1:  At  most  one  of  the  two  force  levels  xit)  and  yit)  can  even  vanish  in  finite 

time. 

RESULT  6.2:  If  either  A(t)  $  L  (0,  +  «)  or  Bit)(H0,  +  «>),  then  the  X  force  (with  X 
force  (with  supporting  fires)  will  be  annihilated  in  finite  time  if  and  only  if 


(6.3) 

where 

(6.4) 


< 

.Vo 
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A  *0,(0)-  Sq(0)  V 


(6.5) 


lim  {5>(T)/0(r)}  =  1/A  *. 

/—  +  oo 

RESULT  6.3:  The  time  of  annihilation  of  the  X  force  (  with  supporting  fires) ,  tf  ,  is  given  by 

t*=  Tf'{{x0CQ{0)  +  yjrRSPmi\xQSQ(0)  +  yrjrRCpm), 


where  TP(t)  denotes  a  hyperbolic-like  GLF  that  is  analogous  to  the  hyperbolic  tangent  and  is  defined 
by  Tp(t)  -  Sp(t)/CP(t). 

RESULT  6.4:  Ifbotha(t)  and  b(t)V L{0,  +  oo),  then  either  Ait)  9 1(0,  +  oo)  or  B(t)QL 

(0,  +  oo). 

RESULT  6.5:  tfa(t)  *=-/3(r)  for  all  t^  [0,  4-  oo),  then 

(6.6)  X(t)  =  Ix0{Cy(0)Cjr(f)  -  S,(0)S*(/)}  -  W^{Q(0)S*(r) 

-  S*(0)Q(r)}]  exp|  -  f/3(s)ds 


and  when  either  a(t)4L(0,  +  °°  )or  b(t)4  L  (0,  +  oo),  then  the  X  force  (with  supporting  fires)  will 
be  annihilated  in  finite  time  if  and  only  if  (4. 6)  holds. 

RESULT  6.6:  If  a(s)  >  @(s)  for  a!!  s  €  [0,/]  with  inequality  holding  for  a  subinterval  of 
positive  length,  then 

x(t)  .  X(t) 


(6.7) 


tM>W>  for  ,>0- 


Results  6.1  through  6.5  follow  from  the  material  in  Sections  4  and  5.  Let  us  now  sketch 
the  proof  of  Result  6.6.  Equations  (4.3)  and  (5.4)  may  be  combined  to  yield 

\A&-£  I 


X[  A 


■  f0(B(s)  —  b(s)}p2ds  +  fQ[U A  (s)  —  \/a(s))p2ds 


(6.8)  +  f'-J  (pX-pX)2 

Jo  a(s)  X2  ’ 

which  is  known  as  the  Picone  formula  (e.g.  see  Ince  [8,  p.  226]).  Next,  we  observe  that  the 
initial  conditions  to  (4.3)  and  (5.4)  are  given  by 

f<01  -  JT(0)  -  and  ^  f  (0)  -  Js-f  (0)  - 


(6.9) 


a  (0)  dr 


Substituting  (6.9)  into  the  left-hand  side  of  (6.8)  and  observing  that  the  right-hand  side  of 
(6.8)  is  positive  for  /  >  0  because  a(s)  >  /3(s)  for  all  s  €  [0,r]  with  strict  inequality  holding 
on  a  subinterval  of  positive  length  implies  that  0  <  A(t)  <  a(r)  and  B(t)  >  b(r)  for  /  >  0 
we  find  that 


yw  <  fw  “  ho  exp[C  ^(s)  -  < 


x(t) 

y(t)’ 


and  Result  6.6  has  been  proved. 
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7.  ILLUSTRATION  OF  THEORY  WITH  RESULTS  FOR 
CONSTANT-COEFFICIENT  MODEL 


In  this  section  we  illustrate  the  above  general  mathematical  theory  of  the  Lanchester-type 
equations  (3.1)  for  the  special  case  in  which  all  the  attrition-rate  coefficients  are  constants. 
Hence,  we  consider 

(7.1)  - - ay  -  fix,  and  -  ~bx  -  ay, 

at  at 

where  a,  b,  a,  and  fi  denote  constants,  and  in  this  case  t0  =  0. 

Applying  the  key  transformation  (5.1)  of  Section  5,  we  find  that  the  transformed  force 
levels  pit)  and  qit)  satisfy  (5.2)  with 

(7.2)  Ait)  =  ae^~a)',  and  Bit)  =  beia  ~ 


and  consequently  (5.4)  reads 

—  03  —  -  ab  p  =  0. 

dt 2  dt 

We  then  find  that 


(7.3) 

(7.4) 


Cpit)  —  e,{P  a)/2  {cos h  Ot  + 


a  —  fi 


29 


SPit) 


=  -/(/3  -  a 


sin h  Ot}, 


)/  2 


sin/j  Ot, 


where  0  =  V ab  +  [(a  —  /3)/2]2.  It  follows  that 


_1_=  0  -  {(«  -  j3)/2] 
A*  Jab 


Hence,  (6.3)  yields  the  known  constant-coefficient  result  (see  Bach,  Dolansky,  and  Stubbs  [1] 
or  Taylor  and  Parry  (17])  that  the  X  force  will  be  annihilated  in  finite  time  if  and  only  if 


(7.5) 


^  <  -j-  0  - 

y0  b  2 


and  also  (6.1)  reads  (since  t0  =  0) 
(7.6) 


U)  =  xoC/’O)  -  y<njj  Spit)  e-* 


which  may  be  written  in  a  more  recognizable  form  as 


(7.7) 


x(t) 


x0  cos h  Ot  -  ^ 

V 


ayo  + 


j3  —  a 


*0 


sin/?  0t\ 


e~Ha  +  0)/2 


8.  DISCUSSION 

The  above  theoretical  results  provide  many  important  insights  into  the  dynamics  of  com¬ 
bat  between  two  homogeneous  forces  with  supporting  fires  not  subject  to  attrition,  and  also 
they  are  quite  useful  for  facilitating  parametric  analyses  (see  [14]  and  [15]).  For  example,  we 
observe  that  when  each  side’s  supporting  fires  are  always  equally  effective  (i.e.  ait)  =  0 it); 
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see  Result  6.5),  they  cancel  out  and  the  battle’s  outcome  in  a  fight-to-the-finish  is  the  same 
(although  the  victor  suffers  greater  losses)  as  when  they  are  not  present.  Strangely  enough, 
although  both  force  levels  decay  more  rapidly  in  the  early  stages  of  battle  (see  equation  (6.6)), 
both  the  length  of  battle  and  the  time  history  of  the  force  ratio  are  the  same  both  with  and 
without  the  supporting  fires. 

Moreover,  when  the  effectiveness  of  one  side’s  supporting  fires  always  dominates  that  of 
the  other  side  (e.g.  ait)  >  pit)  always;  see  Result  6.6),  then  the  force  ratio  is  always  more 
favorable  in  this  model  to  the  side  with  superiority  in  fire  support  than  it  is  in  the  correspond¬ 
ing  model  without  the  supporting  fires.  In  other  words,  gaining  superiority  in  fire  support  is  in 
some  sense  equivalent  to  an  improvement  in  the  force  ratio  of  primary  fighting  systems  (e.g. 
infantries).  Result  6.2  shows,  however,  that  supporting  fires  alone  cannot  win  a  battle  of  attri¬ 
tion  but  that  the  force  ratio  of  the  primary  fighting  systems  must  be  above  a  threshold  value 
(which,  however,  does  depend  on  the  net  effectiveness  of  the  supporting  systems;  see  equa¬ 
tions  (5.5)  and  (6.4))  in  order  for  a  side  to  win.  The  main  point  is  that  it  is  the  (cumulative) 
net  effectiveness  of  the  supporting  weapon  systems  that  modifies  the  relative  effectiveness  of  the 
primary  systems  in  order  to  determine  the  outcome  of  battle  (see  (5.2),  (5.3),  (6.3),  and  (6.4)). 

In  this  paper  we  have  generalized  the  results  of  Taylor  and  Brown  [14]  on  representing 
the  solution  in  terms  of  GLF  and  those  of  Taylor  and  Comstock  [16]  on  predicting  force 
annihilation  for  the  model  (3.1)  by  means  of  the  key  transformation  (5.1).  Such  a  close  rela¬ 
tionship  between  the  two  models  (3.1)  and  (4.1)  was  suggested  by  Taylor  and  Comstock’s 
observation  that  the  Riccati  equation  (e.g.  (3.3))  satisfied  by  the  force  ratio  is  the  same  for  both 
models  when  each  side’s  supporting  fires  are  always  equally  effective,  i.e.  ait)  =  /8(f).  More¬ 
over,  Result  6.5  means  that  all  the  series  solutions  and  GLF  developed  [14],  [15]  and  [16]  for 
the  model  (4.1)  without  supporting  fires  may  be  used  for  the  model  (3.1)  in  this  case.  Further¬ 
more  (as  observed  by  Taylor  and  Comstock’s  [16]),  the  force-annihilation-prediction  conditions 
are  the  same  for  these  two  models  in  this  special  case.  However,  we  have  given  more  general 
force-annihilation-prediction  conditions  (expressed  in  terms  of  special  transcendental  functions, 
i.e.  the  GLF),  which  apply  when  ait)  ^  /3(r)  and  are  complementary  to  Taylor  and  Parry’s 
[17]  conditions  (expressed  in  terms  of  elementary  functions  under  more  restrictive  conditions). 

In  summary,  we  have  presented  new  important  results  about  the  variable-coefficient 
model  (3.1)  for  Lanchester-type  combat  between  two  homogeneous  forces  with  supporting  fires 
not  subject  to  attrition  (see  Figure  1).  These  results  make  the  mathematical  theory  about 
representing  and  analyzing  the  model’s  behavior  just  as  complete  as  that  for  variable-coefficient 
Lanchester-type  equations  of  modern  warfare  without  supporting  fires.  In  many  cases  of  tacti¬ 
cal  interest  (see  the  time-dependent  attrition-rate  coefficients  considered  in  [13  through  16]), 
we  can  now  study  this  variable-coefficient  model  (even  with  supporting  fires)  almost  as  easily 
and  thoroughly  as  Lanchester’s  classic  constant-coefficient  one.  Also,  our  results  may  be 
viewed  as  results  in  the  theory  of  differential  equations,  and  then,  of  course,  they  apply  to  any 
other  system  that  can  be  modelled  with  (3.1)  (e.g.  reaction  kinetics  for  chemical  or  biological 
processes) . 

9.  CONCLUSIONS 

(I)  The  net  effectiveness  of  supporting  weapons  is  the  parameter  that  modifies  the 
relative  effectiveness  of  primary  systems  and  determines  (along  with  the  relative 
effectiveness  and  the  initial  force  ratio  of  the  primary  systems)  the  outcome  of 
combat  between  two  homogeneous  forces  with  supporting  fires  not  subject  to 
attrition. 
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(II)  Such  combat  with  supporting  fires  is  equivalent  to  combat  without  the  supporting 
fires,  only  the  relative  effectiveness  of  the  primary  systems  is  modified  by  the 
cumulative  net  effectiveness  of  the  supporting  systems. 


(Ill)  Supporting  weapons  augment,  but  do  not  replace,  primary  weapon  systems. 
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ABSTRACT 

A  collection  of  jobs  is  to  be  processed  by  a  single  machine.  Each  job  has  a 
cost  function  associated  with  it  which  may  be  either  linear  or  exponential,  costs 
accruing  wucn  a  job  is  completed  The  machine  may  be  allocated  to  the  jobs 
according  to  a  precedence  relation.  The  problem  is  to  find  a  strategy  for  allo¬ 
cating  the  machine  which  minimizes  the  total  cost  and  which  is  consistent  with 
toe  precedence  relation.  The  paper  extends  and  simplifies  some  previous  work 
done  by  SiJney 


1.  INTRODUCTION 

A  job-shop  consists  of  one  machine  and  a  set  J  -  {1,  ,  n)  of  jobs  to  be  processed  on 

it.  Associated  with  job  i  is  a  deterministic  processing  time  X,  and  a  cost  Junction  C,  (.)  such  that 
if  job  /  is  completed  at  time  F,  {flow  time)  its  cost  is  C^Fi).  We  usually  either  assume  that 

0)  C,(t)  =  —C,a',  1  <  /  <  n, 

for  positive  constants  C,  and  discount  rate  a.O  ^  a  <  1,  in  which  case  the  problem  is  said  to 
have  discounted  costs  or  we  assume  that 

(2)  C,(t)  -  C,t.  1  <  /  <  n. 

for  positive  constants  C,  in  which  case  the  costs  are  said  to  be  linear. 

A  precedence  relation  R  exists  on  J  such  that  if  (/  —>)€/?  then  job  i  must  precede  job  j 
in  any  feasible  permutation  of  J.  The  objective  is  to  find  those  permutations  a  of  J  which  are 

consistent  with  R  and  which  minimize  the  total  cost  TC  (a)  -  £C,  {/",(<*)},  Fja)  being  the 

completion  time  of  job  i  under  a. 

Sidney  [5]  considered  the  above  problem  with  linear  costs.  He  proposed  an  algorithm  that 
yields  all  the  optimal  permutations.  This  paper  modifies  and  extends  Sidney's  in  the  following 
ways: 


(i)  Some  new  results  for  stochastic  versions  of  this  problem  are  reported  in  Section  2. 

(ii)  Sections  3  to  6  contain  a  discussion  of  the  difficult  deterministic  problem  with 
discounted  costs  as  well  as  the  rather  easier  problem  with  linear  costs.  An  algorithm  is 
presented  in  Section  5  which  yields  optimal  permutations  for  all  of  these  problems.  The  proofs 
used  to  establish  the  validity  of  this  algorithm  are,  in  the  author's  opinion,  simpler  than  those 
used  by  Sidney. 


> 
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2.  STOCHASTIC  SCHEDULING 

As  Banerjee  111  points  out,  ‘in  real  life  the  time  taken  to  complete  a  job  on  a  machine  is 
invariably  random’.  In  the  light  of  this  assertion,  consider  the  following  stochastic  scheduling 
problem:  Let  job  set  J ,  precedence  relation  R  and  cost  functions  C,,  1  <  /  4  n,  be  as  in  Sec¬ 
tion  1.  The  processing  times  (A’,,  ....  X„\  are  independent  integer-valued  random  variables 
with  honest  distributions.  During  each  time  interval  [r,  /  +  1),  t  a  nonnegative  integer,  just 
one  of  the  unfinished  jobs  is  processed  on  the  machine. 

A  feasible  strategy  n  is  any  rule  for  allocating  the  machine  to  the  jobs 
with  R.  Under  strategy  jr,  job  /  is  completed  at  random  time  Ff  n).  The 

those  feasible  strategies  n  that  minimize  the  total  expected  cost  TC  (rr)  =  E 


which  is  consistent 
objective  is  to  find 

tc((F,(ir)l 


To  date,  progress  has  only  been  made  in  solving  such  stochastic  problems  where  the  con¬ 
straint  set  R  satisfies  fairly  stringent  requirements;  see,  for  example,  Glazebrook  [21  and  Meilij- 
son  and  Weiss  [4).  The  problem  with  general  R  seems  very  complex.  One  possible  way  for¬ 
ward  is  to  find  a  large  class  of  problems  for  which  the  optimal  strategy  can  be  shown  to  be 
nonpreemptive  (that  is,  given  by  a  permutation  of  the  jobs). 

The  following  result  holds  whether  the  cost  structure  is  discounted  or  linear. 

THEOREM  1:  If  the  distributions  of  the  processing  times  X,,  1  <  /  <  n,  have  nonde¬ 
creasing  hazard  rate  there  is  a  nonpreemptive  strategy  which  is  optimal  among  all  the  feasible 
strategies. 

This  result  is  a  consequence  of  Theorems  3  and  4  in  Glazebrook  and  Gittins  (3).  Note 
that  the  class  of  probability  distributions  covered  by  Theorem  1,  namely  those  with  non¬ 
decreasing  hazard  rates,  contains  a  large  number  of  examples  of  interest  including  the 
geometric  distribution.  Note  also  that  a  similar  result  to  Theorem  1  can  be  obtained  for 
continuous-time  problems  by  considering  appropriately  chosen  sequences  of  discrete-time  prob¬ 
lems. 


It  follows  from  Theorem  1  that  many  stochastic  scheduling  problems  are  formally 
equivalent  to  deterministic  ones  in  that  they  have  nonpreemptive  optimal  strategies.  Hence  the 
algorithm  in  Section  5,  although  given  in  the  context  of  deterministic  problems,  also  yields 
optimal  strategies  for  a  large  class  of  stochastic  problems. 

For  the  rest  of  the  paper  we  concentrate  exclusively  on  the  deterministic  case. 

3.  ORDERING  JOB  MODULES:  DISCOUNTED  COSTS 

Throughout  this  section  we  are  concerned  with  the  deterministic  problem  with  costs  given 
by  (1).  However,  before  we  can  state  and  prove  our’main  result  we  require  some  terminology 
and  notation. 

A  set  of  jobs  M  c  J,  to  be  denoted  by  1(1, J).  1  <  /  <  m,  1  <  j  <  «,),  forms  a  job 
module  if  and  only  if 

(i)  [(/,  j)  —  (/',  /)1  €  R,  1  <  /  <  m,  1  <  j  <  /  <  nr 

(ii)  ((/,  j)  —  (*,/)]«  fl.  1  </**<  m.  1  <  j  nk. 
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(iii)  If  a  €  J- M  then  precisely  one  of  (a),  (b)  or  (c)  holds: 


(a)  (a  —  (/,  j)]  €  R,  1  <  I  <  m,  1  <  j  <  n,\ 

(b)  [(/,  j)  —> •  a]  €  R,  1  <  i  <  m,  1  <  j  <  n,\ 

(c)  (la  —  (/.  j)],  ((/',  j)  —  all  n  R  —  d>.  1  <  /  <  m,  1  <  j  ^  n,. 


From  (i)  and  (ii),  the  jobs  in  M  form  disjoint  chains  ((/.  j),  1  <  j  <  n,),  1  <  /  <  m. 
Condition  (iii)  indicates  that  a  job  in  J-M  has  the  same  precedence  relation  to  every  job  in  M. 
Hence  a  job  in  J-M  will  either  precede  every  job  in  M  (iii(a))  or  be  preceded  by  every  job  in  M 
(iii(b))  or  have  no  relation  to  any  job  in  M  (iii(c)). 


and 

(3) 


To  simplify  the  algebraic  expressions  in  what  follows  we  introduce  the  notation  M,  =  a* 


CM[a\,  a2 . a.) 


IC, 

r-1 


n^, 

/-I 


II1 -_H' 


where  (a|.  a2 . ap )  is  an  ordered  subset  of  J  consistent  with  R.  The  function  CA/will  play 

an  important  part  in  the  analysis. 


The  following  two  Lemmas  which  are  both  easy  to  establish  are  used  in  the  proof  of 
Theorem  2. 


LEMMA  1:  Let  max  {CA/[a|,  a2, 

j-U\)p 

largest  such  maximizing  index,  then 

...  .  a,]}  - 

CM[a\, 

«2 . 

a  /•]  where  j*  is  the 

(4)  max  ( CM[ar ,  ar+1 . a,l)  > 

j-r(  l)p 

max  \CM[a 

/-id)/. 

1.  “2.  • 

. .  ,  a,]), 

1  <  r  <  j\ 

and 

(5)  max  [CM[ar,  a,+1 . a,])  < 

j-r(\)p 

max  (CA/la 

3-1(1)/. 

I-  «2.  • 

• .  .  a,]}, 

j*  <  r  <  p. 

LEMMA  2:  If  S|  -  (a|,  ....  a,,},  S2  -  (0i . Pq)  and  S3  =  (-y  j . y, }  are  mutu¬ 

ally  disjoint  ordered  subsets  of  J  consistent  with  R  such  that 

(i)  (S2,  S3)  “  (0i . Pq.  y\,  ...  ,  y,)  is  also  an  ordered  subset  of  J  consistent  with 

R, 


(ii)  CM  (a! . ap]  ^  CM[P\ . fjq],  CM  (ai . ap)  ^  CM[yx . yr]  and 

one  of  these  two  inequalities  is  strict,  then 

CM\ax . “/>!  >  CM[/3\ . /3q,  yi . yr). 


Theorem  2  tells  us  in  which  order  the  jobs  in  job  module  M  will  be  processed  in  an 
optimal  permutation  for  the  jobs  in  J. 


THEOREM  2:  There  is  an  optimal  permutation  for  the  jobs  in  J  such  that  the  following 
rule  is  adopted  for  processing  the  jobs  ((/.  j),  1  <  /  <  m.  1  <  j  ^  n,}  in  M:  If  at  some  time 
only  jobs  ((/,  j),  1  <  /  <  m,  l<  j  ^  s()  among  the  jobs  in  M  have  been  completed,  then 
the  next  job  in  M  to  be  processed  will  be  a  job  ( Ac ,  sk  -I-  1)  where  k  is  chosen  such  that  sk  <  nk 
and 


(6)  max  (CA/[(Ac, 

y-Cs^  +  l)  (I )nk 


sk+  1) . (Ac,  7 ) ) )  —  max 

h 


nax  |  mi 

5/<"/l  u-*5/+ 


max  (CA/((/.  s,  +  1), 


IHDii, 
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If  more  than  one  job  satisfies  these  criteria  it  is  immaterial  which  of  them  is  chosen  next. 

PROOF:  Those  jobs  from  {(/',  j),  1  <  j  <  n J  still  remaining  to  be  processed  at  any  time 
will  be  referred  to  as  chain  i.  If  at  some  time  chain  i  consists  of  the  jobs  ((/,  7),  s,  +  1 
<  7  <  «,}  it  has  associated  with  it  (see  (6))  the  quantity 

(7)  max  \CM[(i,  s,  +  1) . (/',  7)]} 

which  will  be  called  the  allocation  index  for  chain  i. 


The  theorem  is  proved  by  using  an  induction  on  the  number  of  jobs  in  M.  If  \M\  -  1 
the  result  is  trivially  true.  We  suppose  that  the  theorem  is  true  for  1  <  |A/|  <  n*but  not  for 
\M\  -  n*and  show  that  this  leads  to  a  contradiction.  Consider  a  problem  in  which  |A/|  -  n*. 


We  suppose  that  in  an  optimal  permutation  for  J  the  first  job  to  be  chosen  from  M  is 
(s.  1)  where  chain  1,  say,  has  the  largest  allocation  index  and  chain  s  does  not.  Hence,  from 
(7) 

max  (CA/[(s,  1) . (s,  7)1}  -  CMl(s.  1) . (s,j{s)) ]  <  CA/[(1,  1) . (1.  >{1))1 

>  —  l  <  1 )  n. 


(8) 


-  max  (0/1(1,  1) . (1,  »]) 

y-KD/i. 


where  7(1 1  and  7(s)  are  the  largest  such  maximizing  indices.  From  the  inductive  hypothesis, 
when  job  (s,  1)  has  been  completed  the  remaining  jobs  in  M  will  be  processed  in  the  order 
described  in  the  statement  of  the  theorem.  It  follows  from  Lemma  I  that  the  jobs  in  M  will  be 

processed  in  the  order  ((s,  1) .  (s,  Ms)),  (1,  1),  (1,7(1));  7")  for  some  Ms), 

1  <  MM  <  »s,  where  T  is  an  ordering  of  the  job  set  J  -  ((1,  /),  1  <  /  <  7(1);  (s,  /),  1  < 
/  <  Ms)).  Consider  now  the  following  three  cases  which  are  mutually  exclusive  and  exhaus¬ 
tive. 


The  first  set  of  jobs  in  M  to  be  allocated  service  before  being  interrupted  by  the  allocation 
of  the  machine  to  a  job  in  J-M : 

CASE  1  contains  ((s,  1) . (s,  Ms)),  (1,  1),  ,  (1,  7(1))}; 

CASE  2  is  ((s,  1),  ....  (s,  MM),  (1,  1),  ....  (1,  p))  for  some  p,  1  <  p  <  7(1); 

CASE  3  is  ((s,  1) . (s,  </))  for  some  q,  1  <  q  <  Ms). 

We  now  consider  case  2  in  detail. 

CASE  2:  Following  the  processing  of  jobs  {(s,  1),  ...  ,  (s,  Ms}),  (1.  1),  ....  (1,  p)} 
from  A/,  the  first  "visit"  is  made  to  J-M  (that  is,  the  first  visit  after  service  is  begun  on  A/). 
During  this  first  visit,  jobs  |(a,  v),  1  <  v  <  r,},  say,  are  processed  in  order  of  increasing  v. 
After  this  visit,  jobs  {(1,  p  +  1),  ...  ,  (1,  p(l))}  from  M  are  processed  where  p  <  p(l)  < 
7(1).  During  the  r  th  visit  to  J- M  jobs  ((a,  v),  <  v  <  tr]  are  processed  in  order  of 

increasing  v.  Assume  that  in  general,  after  the  r  th  visit  to  J-M ,  module  jobs 

((1,  p[r  -  1)  +  1) . (1,  p[r))}  are  processed  and  that  p[r*}  —  y(l).  The  fact  that  after 

(1,7(1))  has  been  processed  other  jobs  from  M  may  also  be  processed  before  the  (r*  +  l)5' 
visit  to  J- M  causes  no  problems. 

Depict  the  permutation  described  above  as  follows: 

({(s,  j),  1  <7  <  fc(s);  (1,7),  1  <7  <p)>  ((«,  v),  1  <  v  <  r,},  ((1,  j),  p  +  1  <7  <  p(l}},  U) 
where  U  is  an  appropriate  job  ordering.  Call  this  permutation  1. 
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Consider  now  the  following  two  permutations  which,  from  the  definition  of  a  job  module, 
are  also  consistent  with  R.  Permutation  2  is 

({(«.  v),  1  <  v  <  /,),  ((s,  j).  1  <  j  Ac { s } ;  (1,  j),  1  <  j  <  p{l )).  U). 

Permutation  3  is 

(|(s.  J),  1  ^  A  U);  0,  ;),  1  <£  j  <  />{1 )},  U«.  v).  1  <v<  /i),  U). 

It  is  easy  to  show  that  the  fact  that  permutation  2  gives  rise  to  at  least  as  high  a  cost  as 
does  permutation  1  implies  that 

(9)  CA/Ks,  1).  ....  (s,  A{s|),  (1,  1) . (1,  p) J  ^  CM[(a,  1) . (a,  /))}. 

Similarly,  the  fact  that  permutation  3  gives  rise  to  at  least  as  high  a  cost  as  does  permutation  1 
implies  that 

(10)  CA/Ka.  1) . (a,  f])l  ^  CA/Kl.  p  +  1) . (1.  p(l))l- 

From  (9)  and  (10)  it  follows  that 

(11)  CA/Ks,  1) . (s.  k\s)).  (1,  1) . (1,  p )]  >  CA/Kl ,  p  +  1) . U.  P 11 1)1 

Repeating  the  above  argument  for  all  visits  to  J- M  up  to  and  including  the  r*th  one,  the 
following  inequality  is  obtained: 

(12)  CM l(s.  1) . (s.  /els}).  (1.  1) . (1  ,p))  >  CA/Kl,  p(r*  —  1)  +  1) . (1,/(1))]. 

However,  it  follows  from  Lemma  1  that 

CA/Kl,  p(r*  -  1)  +  1) . (1.  J  111)]  >  CA/Kl,  1) . (1,  /(H)l 

(13) 

>  CA/Kl,  1) . (1,  p)]. 

It  follows  from  Lemma  1  and  the  fact  that  chain  1  has  a  larger  allocation  index  than  chain  s  that 
CA/Kl,  p(r'  -  1)  +  1) . (1,  ;{1))]  >  CA/Kl.  1) . d.  7(D)] 


>  CA/Ks.  1) . (s,  j(s\)  1  >  CA/Ks.  t) . (s,  AcUDl- 

It  now  follows  from  (13),  (14)  and  Lemma  2  that 

CA/Kl.  p(r *  -  1)  +  1) . (1,  y'llDl  >  CA/Ks,  1) . (s.  A-(s)).  (1,  1) . (1.  p) I. 

which  contradicts  (12). 

We  obtain  similar  contradictions  for  cases  1  and  3.  It  may  further  be  shown,  using  similar 
techniques,  that  in  the  event  of  more  than  one  unfinished  chain  satisfying  (6)  it  is  immaterial 
which  of  the  appropriate  jobs  is  chosen  next.  Hence  the  induction  proceeds  and  the  theorem  is 
proved. 

4.  ORDERING  JOB  MODULES:  LINEAR  COSTS 

Theorem  3  tells  us  in  which  order  the  jobs  in  job  module  A/  will  be  processed  in  an 
optimal  permutation  for  the  jobs  in  /  when  the  costs  are  linear  as  in  (2). 

THEOREM  3:  There  is  an  optimal  permutation  for  the  jobs  in  J  such  that  the  following 
rule  is  adopted  for  processing  the  jobs  ((/,  /),  1  ^  i  K  m.  1  ^  j  ^  n,1  'n  If  at  some  time 
only  jobs  {(/,  J).  1  <  /  <  m,  1  <  j  <  s,)  among  the  jobs  in  A/ have  been  completed,  then  the 
next  job  in  A/ to  be  processed  will  be  a  job  (A,  s*  +  1)  where  k  is  chosen  such  that  sk  <  nK  and 
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max 

y-lsj  +  lMDiij 


I  Q, 


t  Xkr 


max  max 

I /:*,<«,]  ./•(*,+ !)(!)«, 


I  x, 

r*.+l 


If  more  than  one  job  satisfies  these  criteria  it  is  immaterial  which  of  them  is  chosen  next.  This 
result,  previously  established  by  Sidney  15],  may  be  deduced  from  Theorem  2  by  considering 
the  limit  as  the  discount  rate  a  tends  to  one. 


5.  THE  ALGORITHM 

Suppose  the  jobs  ((/',  J),  1  <  /  <  m,  1  <  j  <  n,}  form  a  job  module  M  within  the  set  of 
jobs  J  with  precedence  relation  R.  A  chain  transformation  CM  acts  on  the  pair  ( J ,  R ) 
transforming  it  to  the  pair  ( J ,  R  U  S)  where 

(i) 

(ii)  with  respect  to  R  U  S  the  jobs  in  M  form  a  single  chain; 

(iii)  S  contains  only  elements  of  the  form  (a  —  fi)  a  €  M.  /i  €  M.  A  chain  transfor¬ 
mation,  then,  is  a  device  for  changing  job  modules  into  chains.  (J,  R)  is  chain 
reducible  if  it  can  be  transformed  to  a  pair  (J,  R  U  T)  by  successive  chain 
transformations,  where  with  respect  to  precedence  relation  RUT  the  jobs  in  J 
form  a  single  chain. 

An  optimal  permutation  for  any  chain  reducible  pair  ( J ,  R)  with  discounted  or  linear 
costs  may  be  obtained  by  successive  applications  of  whichever  is  appropriate  of  Theorem  2  or 
Theorem  3.  This  is  easily  seen  since  those  results  demonstrate  how  to  process  jobs  within  a  job 
module  in  an  optimal  fashion.  Such  a  module,  once  ordered  in  this  way,  effectively  becomes  a 
chain.  In  this  way,  using  these  theorems  is  seen  to  be  equivalent  to  applying  a  chain  transfor¬ 
mation.  If  by  successive  such  applications  of  the  theorems,  the  pair  (J,  R)  is  reduced  to  a  sin¬ 
gle  chain  of  jobs,  then  this  chain  must  evidently  correspond  to  an  optimal  permutation. 

However,  as  can  easily  be  shown,  not  all  pairs  (J,  R)  are  chain  reducible.  The  procedure 
for  solving  general  problems  that  is  described  in  this  section  utilizes  in  a  helpful  way  what  we 
know  about  ordering  chain  reducible  pairs.  Before  presenting  this  procedure  some  extra  termi¬ 
nology  is  required. 

Suppose*  that  U  is  a  subset  of  J  then  the  restriction  of  R  to  t/,  is  the  set 

R  n  ((/  — •  j),  i  €  U.  j  €  t/).  If  a  is  a  permutation  of  U  consistent  with  K|lMhen  a(U)  is 
the  ordered  set  obtained  by  the  application  of  a  to  U.  Further,  U  is  said  to  be  initial  if  there 
does  not  exist  /  €  J  -  U,  j  €  V  such  that  (i  — »  j)  €  R.  An  initial  permutation  a  is  a  permuta¬ 
tion  for  an  initial  subset  U. 

If  a  is  a  permutation  for  the  subset  (/,  then  /3  is  a  sub-ordering  of  a  if  and  only  if  /3  is  a 
permutation  for  some  subset  V  Q  U,  and  there  exists  a  permutation  for  U  —  V,  y  say,  such 
that  a(U)  -  |(0(  F),  y(U  -  F)}.  When  this  happens  we  write  a  -  (/3,  y). 

The  algorithm  for  solving  the  problem  associated  with  a  general  pair  ( J ,  R)  with  either 
discounted  or  linear  costs  is  as  follows: 

STEP  1:  Compute  all  the  initial  permutations  a(,  1  </<<?,  say,  with  the  following  pro¬ 
perties: 
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(i)  If  a,  is  a  permutation  for  initial  subset  U„  then  (J  —  Ult  R\J  -  U,)  is  chain  reduc¬ 
ible,  1  <  /  4  (fy 

(ii)  a,  is  minimal  in  the  sense  that  it  has  no  sub-ordering  which  satisfies  (i),  1  <  /  <  q. 

STEP  2:  Compute  an  optimal  permutation  /3,  for  the  pair  ( J  -  U,R\J  -  U,).  1  </<</. 
Since  these  pairs  are  all  chain  reducible  such  permutations  may  be  obtained  by  the  successive 
use  of  whichever  is  appropriate  of  Theorem  2  or  Theorem  3. 

STEP  3:  Select  that  permutation  for  J  from  among  the  {a,,  /3,),  1  <  /  <  q,  which  gives 
the  smallest  total  cost.  This  is  an  optimal  permutation. 


It  is  clear  that  this  procedure  does  yield  an  optimal  permutation.  Its  computational 

efficiency  in  any  problem  will  be  dependent  upon  the  nature  of  the  pair  ( J ,  R).  In  any  prob¬ 
lem  with  linear  deferral  costs  the  option  of  using  Sidney’s  algorithm  is  also  open  to  us. 

Sidney’s  algorithm  successively  chooses  from  J  initial  subsets  of  a  particular  kind  and  operates 

as  follows:  In  r  steps  of  the  algorithm  we  choose  subsets  U,,  1  <  /  <  r,  subject  to  specified 

7-1  7-i  ) 

J  -  U  U„  R\J  -  U  U,\ 


criteria  where  U ,  is  initial  in 
optimal  permutation  for  Uj,  then  (a. 


i-i  /-i 

t 

. a,)  is  optimal  for  [J,  /?}. 


and  where  U  U,  =  J.  If  a,  is  an 


Note  that  it  is  possible  to  use  Theorem  3  in  conjunction  with  Sidney's  algorithm  in  a 
problem  with  linear  costs.  Such  an  approach  is  exemplified  in  the  next  section. 


6.  AN  EXAMPLE 


The  example  discussed  here  is  the  one  given  by  Sidney  [5].  We  seek  an  optimal  permuta¬ 
tion  of  the  job  set  J  —  {1,  2,  3,  4,  5,  6,  7j.  Each  job  /  has  a  linear  cost  function  as  in  (2)  with 

C,  -  1.  The  processing  times  are  given  by  {Af!,  X2,  X2,  T4,  X5,  A"*,  X2]  —  (5,  8,  3,  5,  3,  7, 

6)  and  the  precedence  relation  /?  is  {  (1  —  3),  (1  —  4),  (1  —  6),  (1  —  7),  (2  —  4),  (2  — 1 ■  5), 

(2  -  6),  (2  -  7),  (3  -  7),  (4  -  6),  (4  -  7),  (5  -  6),  (5  -  7),  (6  -  7)). 

METHOD  1  (Glazebrook’s  algorithm):  Adopt  the  procedure  outlined  in  the  previous  sec¬ 
tion. 


STEP  1:  Initial  permutations  <*|,  a2  and  a3  are  chosen  together  with  initial  subsets 
Ui,  U2  and  (/3,  where  «,(£/,)  -  |1),  a2(U2)  -  {2,  1)  and  a3(f/3)  -  {2,  5). 

STEP  2:  Use  Theorem  3  to  compute  j3t,  j32  and  03.  Consider,  for  example,  the  computa¬ 
tion  of  /33,  an  optimal  permutation  for  (J  —  {2,  5);  Af  |y  —  (2,  5))  -  ({1,  3,  4,  6,  7);  {(1  —  3), 
(1  — *  4),  (I  — ►  6),  (1  — ►  7),  (3  —  7),  (4  —  6),  (4  —  7),  (6  — *  7)}).  Job  set  {3,  4,  6)  is  a  job 
module  in  this  new  problem  since  job  1  precedes  all  of  jobs  3,  4  and  6,  job  7  is  preceded  by 
them  all  and  (3)  and  (4,  6)  are  disjoint  chains.  Theorem  3  applied  to  this  new  problem  indi¬ 
cates  that  in  an  optimal  permutation  the  jobs  in  the  module  will  be  processed  in  the  order 
{3,  4,  6)  since 


1 

c3  ^ 

c4 

c4 

+  C6 

1 

3  “ 

y  ^  max 
*3 

V 

x< 

+  *6. 

“  5 

Hence  0 3(/ -  t/3)  -  {1,  3,  4.  6.  7).  Similarly  /3,(y  -  £/,)  -  |3.  2.  5,  4.  6,  7}  and 
p2U  -  U2)  -  |3,  5,  4,  6.  7). 

STEP  3:  Of  the  permutations  {a|,  /3\),  {a2,  /J2)  and  {«3,  /J3)  the  first  gives  rise  to  the 
smallest  total  cost.  Hence  an  optimal  permutation  for  (J.R)  is  given  by  {« , ( t/]) ,  /3 \(J  - 
£/,))  —  (1.  3.  2.  5.  4.  6.  7). 
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METHOD  2  (Sidney’s  algorithm):  According  to  the  criteria  which  Sidney  sets  out  in  [5), 
the  subsets  (/)  -  (1,  3|,  then  U2  =  (2,  4.  5)  and  then  U 3  =  (6,  7|  are  chosen.  ylt  y2  and  y3 
are  optimal  permutations  for  V\,  U2,  and  U3  respectively.  It  can  be  shown  that 
yi(f/,)  -  {1,  3|.  y2(t/2)  -  (2.  5.  4)  and  y3(U})  -  {6,  7)  so  that  an  optimal  permutation  for 
0.  R)  is  given  by  {1,  3,  2,  5,  4,  6,  7). 

METHOD  3  (Combination  of  Methods  1  and  2):  Use  Sidney’s  algorithm  to  choose  the 
initial  subset  U\  =  {1,  3)  and  note  that  its  optimal  permutation  y,  is  such  that  y i ( C/j)  =  {1,  3}. 
(/-(I,  3),  R\J-(\,  3])  or  ({2,  4,  5.  6,  7),  {(2-4),  (2-5),  (2  -  6),  (2  -  7),  (4-6), 
(4  —  7),  (5  —  6),  (5  —  7),  (6  —  7)1)  is  chain  reducible  with  an  optimal  permutation  5  which 
may  be  computed  by  using  Theorem  3.  It  is  found  that  8(J  -  {1,  3})  =  (2,  5,  4,  6,  7}  and  so 
an  optimal  permutation  for  (/,  R)  is  that  implied  by  the  ordered  set  (1,  3,  2,  5,  4,  6,  7). 
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ABSTRACT 

In  this  paper  we  precisely  define  the  iwo  types  of  simulations  (terminating 
and  steady-state)  with  regard  to  analysis  of  simulation  output  and  discuss  some 
common  measures  of  performance  for  each  type.  In  addition,  we  conclude,  on 
the  basis  of  discussions  with  many  simulation  practitioners,  that  both  types  of 
simulations  are  important  in  practice.  This  is  contrary  to  the  impression  one 
gets  from  reading  the  simulation  literature,  where  the  steady-state  case  is  al¬ 
most  exclusively  considered. 

Although  analyses  of  terminating  simulations  are  considerably  easier  than 
are  those  of  steady-state  simulations,  they  have  not  received  a  careful  treat¬ 
ment  in  the  literature.  We  discuss  and  give  empirical  results  for  fixed  sample 
size,  relative  width,  and  absolute  width  procedures  that  can  be  used  for  con¬ 
structing  confidence  intervals  for  measures  of  performance  in  the  terminating 
case. 


1.  TYPES  OF  SIMULATIONS  WITH  REGARD  TO  ANALYSIS  OF  THE  OUTPUT 

We  begin  by  giving  a  precise  definition  of  the  two  types  of  simulations  with  regard  to 
analysis  of  the  output  data  (cf.  Gafarian  and  Ancker  [4]  and  Kleijnen  [6]).  A  terminating  simu¬ 
lation  is  one  for  which  the  quantities  of  interest  are  defined  relative  to  the  interval  of  simulated 
time  [0,r£],  where  7) ,  a  possibly  degenerate  random  variable  (r.v.),  is  the  time  that  a  specified 
event  £  occurs.  The  following  are  some  examples  of  terminating  simulations: 

a)  Consider  a  retail  establishment  (e  g.,  a  bank)  which  closes  each  evening  (physically  ter¬ 
minating).  If  the  establishment  is  open  from  9  to  5,  then  the  objective  of  a  simulation 
might  be  to  estimate  some  measure  of  the  quality  of  customer  service  over  the  period 
beginning  at  9  and  ending  when  the  last  customer  who  entered  before  the  doors  closed  at 
5  has  been  served.  In  this  case  £  =  {at  least  8  hours  of  simulated  time  have  elapsed  and 
the  system  is  empty),  and  reasonable  initial  conditions  for  the  simulation  might  be  that  no 
customers  are  present  at  time  0. 

b)  Consider  a  telephone  exchange  which  is  always  open  (physically  nonterminating).  The 
objective  of  a  simulation  might  be  to  determine  the  number  of  (permanent)  telephone 
lines  needed  to  service  adequately  incoming  calls.  Since  the  arrival  rate  of  calls  changes 
with  the  time  of  day,  day  of  the  week,  etc.,  it  is  unlikely  that  a  steady-state  measure  of 
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performance  (see  Section  2),  which  is  defined  as  a  limit  as  time  goes  to  infinity,  will  exist. 
A  common  objective  in  this  case  is  to  study  the  system  during  the  period  of  peak  loading, 
say,  of  length  t  hours,  since  the  number  of  lines  sufficient  for  this  period  will  also  do  for 
the  rest  of  the  day.  In  this  case,  £  =  [t  hours  of  simulated  time  have  elapsed).  However, 
care  must  be  taken  in  choosing  the  number  of  waiting  calls  at  time  0,  since  the  actual  sys¬ 
tem  will  probably  be  quite  congested  at  the  beginning  of  the  period  of  peak  loading.  One 
approach  would  be  to  collect  data  from  the  actual  system  on  the  distribution  of  the 
number  of  waiting  calls  at  the  beginning  of  the  peak  period.  Then  for  each  simulation  run 
of  length  t  hours,  a  sample  is  generated  from  this  distribution  and  used  as  the  number  of 
waiting  calls  at  time  0. 

c)  Consider  a  military  confrontation  between  a  defensive  (fixed  position)  blue  force  and  an 
offensive  (attacking)  red  force.  Relative  to  some  initial  force  strengths,  the  objective  of  a 
simulation  might  be  to  estimate  some  function  of  the  (final)  force  strengths  at  the  time 
that  the  red  force  moves  to  within  a  certain  specified  distance  from  the  blue  force.  In  this 
case,  £  =  (red  force  has  moved  to  within  a  certain  specified  distance  from  the  blue  force). 

A  steady-state  simulation  is  one  for  which  the  quantities  of  interest  are  defined  as  limits  as 
the  length  of  the  simulation  goes  to  infinity.  Since  there  is  no  natural  event  to  terminate  the 
simulation,  the  length  of  the  simulation  is  made  large  enough  to  get  "good"  estimates  of  the 
quantities  of  interest.  Alternatively,  the  length  of  the  simulation  could  be  determined  by  cost 
considerations;  however,  this  may  not  produce  acceptable  results.  The  following  are  some 
examples  of  steady-state  simulations: 

a)  Consider  a  computer  manufacturer  who  constructs  a  simulation  model  of  a  proposed  com¬ 
puter  system.  Rather  than  use  data  from  the  arrival  process  of  an  existing  computer  sys¬ 
tem  as  input  to  the  model,  he  typically  assumes  that  jobs  arrive  in  accordance  with  a  Pois¬ 
son  process  with  rate  equal  to  the  predicted  arrival  rate  of  jobs  during  the  period  of  peak 
loading.  He  is  interested  in  estimating  the  response  time  of  a  job  after  the  system  has 
been  running  long  enough  so  that  initial  conditions  (e.g.,  the  number  of  jobs  in  the  sys¬ 
tem  at  time  0)  no  longer  have  any  effect.  (Assuming  that  the  arrival  rate  is  constant  over 
time  allows  steady-state  measures  to  exist.) 

b)  A  chemical  manufacturer  constructs  a  simulation  model  of  a  proposed  chemical  process 
operation.  The  process,  when  in  operation,  will  be  subject  to  randomly  occurring  break¬ 
downs.  The  input  rate  of  raw  materials  to  the  process  and  the  controllable  parameters  of 
the  process  are  both  assumed  to  be  stationary  with  respect  to  time.  The  company  would 
like  to  estimate  the  production  rate  after  the  process  has  been  running  long  enough  so 
that  initial  conditions  no  longer  have  any  effect. 

The  remainder  of  this  paper  is  organized  as  follows.  In  Section  2  we  discuss  some  com¬ 
mon  measures  of  performance  for  both  types  of  simulations  and  in  Section  3  we  present  our 
findings  on  the  relative  occurrence  of  each  type  in  the  real  world.  A  number  of  procedures 
which  can  be  used  to  construct  confidence  intervals  for  terminating  simulations  are  discussed  in 
Section  4  and,  finally  in  Section  5  we  summarize  our  findings. 

2.  MEASURES  OF  SYSTEM  PERFORMANCE 

To  the  best  of  our  knowledge,  nowhere  in  the  simulation  literature  are  measures  of  per¬ 
formance  for  terminating  simulations  explicitly  defined.  In  this  section  we  define  and  contrast 
several  common  measures  of  performance  for  terminating  and  steady-state  simulations  by 
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means  of  examples.  (Because  of  the  diversity  of  terminating  simulations,  it  is  not  possible  to 
give  one  definition  that  fits  all  cases.)  For  the  simple  examples  that  we  consider,  it  is  possible  to 
compute  analytically  measures  of  performance.  This  fact  will  be  useful  in  Section  4  where  we  study 
stopping  rules  for  terminating  simulations. 


A.  Averages 


Consider  first  the  stochastic  process  {£>,.  ‘  >  1}  for  the  M/M/1  queue  with  p  <  1,  where 
D,  is  the  delay  in  queue  of  the  i  th  customer.  The  objective  of  a  terminating  simulation  of  the 
M/M/1  queue  might  be  to  estimate  the  expected  average  delay  of  the  first  m  customers  given 
some  initial  conditions,  say,  that  the  number  of  customers  in  the  system  at  time  0,  MO),  is 
zero.  The  desired  quantity,  which  we  denote  by  d(m  I  MO)  =  0),  is  then  given  by 


rf(m|/V(0)  =  0)  =  E 


£  Di/m\N(0)  =  ol 

/-l 


m 

=  £  £[Z>,|/V(0)  =  0]/ m. 

/-I 

(Alternatively,  if  one  is  interested  in  estimating  the  expected  average  delay  of  all  customers 
who  arrive  and  complete  their  delay  in  the  time  interval  [0,  r],  then  the  desired  quantity  is 
given  by 


d(t\N(0)  -  0)  -  E  pf  A/^(')lA'(0)=0| 

where  M(t)  (a  r.v.)  is  the  number  of  customers  who  arrive  and  complete  their  delay  in  the 
interval  (0,  /].  Note  that  in  this  case  the  expectation  and  summation  are  not  interchangeable. 
Thus,  the  label  "expected  average  delay"  is  more  general  than  "average  expected  delay.”)  Note 
also  that  the  quantity  rf(m|jV(0)  =  0),  which  is  often  called  a  transient  characteristic  of  the  sto¬ 
chastic  process  {Dhi  >  1),  explicitly  depends  on  the  state  of  the  system  at  time  0;  i.e., 
dim |/V(0)  =  /)  ^  </(/w|A(0)  =  j)  for  i  ^  j. 


The  objective  of  a  steady-state  simulation  of  {/),,/  ^  1}  for  the  M/M/1  queue  would  be  to 
estimate  the  steady-state  expected  average  delay  d,  which  is  given  by 

d  =  lim  d(m\N(Q)  =  i)  for  any  i  =  0, 1,  . . .  . 

m— >  oo 

(Under  our  assumption  that  p  <  1,  it  can  be  shown  that  Sexists.)  Observe  that  d  is  indepen¬ 
dent  of  iV(0).  In  Figure  1  we  plot  d(m\N(0)  =  0)  as  a  function  of  m.  (The  arrival  rate  A  =  1 
and  the  service  ratep  =  10/9,  sop  =  .9.)  The  horizontal  line  that  </(w|Ar(0)=0)  asymptoti¬ 
cally  approaches  is  at  height  d  =  8.1. 


As  a  second  example  consider  the  stochastic  process  {£,,/  >  1}  for  an  (s,  S)  inventory 
system  with  zero  delivery  lag  and  backlogging,  where  £,  is  the  expenditure  in  the  i  th  period. 
This  system  is  described  in  detail  in  Law  [8].  A  possible  objective  of  a  terminating  simulation 
would  be  to  estimate  the  expected  average  cost  for  the  first  m  periods  given  that  the  inventory 
level  at  the  beginning  of  period  1,  /,,  is  S  : 


e(m  |/i 


S) 


E 


i- 1 


S  . 


The  objective  of  a  steady-state  simulation  of  {£,,/  >  1)  would  be  to  estimate  the  steady-state 
expected  average  cost: 


e  -  lim  e{m\l\  -  /)  for  any  i  =  0,  ±1,  ±2 . 
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(It  can  be  shown  that  e  exists.)  We  plot  e(m\I\  —  S)  as  a  function  of  m  and  also  e  —  112.108 
in  Figure  2. 


6  12  18 
Figure  2.  <"(ml/|-Sl  as  a  function  of  m  for  Hie  ( s.S)  inventory  system 

Our  third  example  is  quite  different  from  the  first  two.  A  reliability  model  consisting  of 
three  components  will  function  as  long  as  component  1  works  and  either  component  2  or  3 
works.  If  T  is  the  time  to  failure  of  the  whole  system  and  7)  the  time  to  failure  of  component 
i(i  -  1, 2, 3),  then 

T  -  min[7j,  max(r2,  7j)l. 

We  further  assume  that  the  T's  are  independent  r.v.’s  and  each  7]  has  a  Weibull  distribution 
with  shape  parameter  a  =  .5  and  scale  parameter  /3  “  1.  (A  distributional  assumption  for  the 
T('s  is  needed  in  Section  4  where  we  present  simulation  results  for  this  model.)  The  objective 
of  a  terminating  simulation  might  be  to  estimate  the  expected  time  to  failure  of  the  system  given 
that  all  components  are  new,  £(7'|all  components  are  new).  If  we  assume  that  the  system  is 
not  repaired  when  it  fails,  then  a  steady-state  simulation  makes  no  sense  for  this  system.  Such 
could  be  the  case  if  this  system  were  part  of  a  space  probe. 
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B.  Proportions 


The  usual  criterion  for  comparing  two  or  more  systems  is  some  sort  of  average  behavior. 
However,  different  kinds  of  information  may  be  of  more  value  in  some  situations.  For  exam¬ 
ple,  a  bank  manager  may  be  concerned  with  estimating  the  proportion  of  customers  who  experi¬ 
ence  a  delay  in  excess  of  5  minutes.  Since  proportions  are  really  just  a  special  case  of  averages, 
we  will  illustrate  them  by  means  of  the  M/M/1  example. 


In  a  terminating  simulation  of  the  M/M/1  queue  the  objective  might  be  to  estimate, 
instead  of  an  expected  average  delay,  the  expected  proportion  of  the  first  m  customers  whose  delay 
is  less  than  or  equal  to  x  (a  specified  number)  given  that  N(0)  =  0.  Denote  the  desired  quan¬ 
tity  by  P(m,  .V  |  /V (0)  =  0)  and  let 


Y,(x)  = 

Then  P(m,  .v|A(0)  =  0)  is  given  by 

P(m,  x\N(0)  =  0)  =  E 


1  if  D,  <  x 

0  if  D,  >xfor/=1-2 . 


£  Yj(x)/m\N(0)  =  0 

i- 1 


For  a  steady-state  simulation,  the  objective  would  be  to  estimate  the  steady-state  expected  propor¬ 
tion  of  customers  whose  delay  is  less  than  or  equal  to  x  : 

P(x)  =  lim  Pirn,  x|jV(0)  =  /)  for  any  i  =  0, 1 . 

m— oo 


3.  RELATIVE  IMPORTANCE  OF  EACH  TYPE  OF  SIMULATION 

Reading  the  simulation  literature  leads  one  to  think  that  only  steady-state  simulations  are 
important;  almost  every  paper  written  on  the  analysis  of  simulation  output  data  deals  with  the 
steady-state  case.  This  may  be  a  carry-over  from  mathematical  queueing  theory  where  only  a 
.  teady-state  analysis  is  generally  possible.  However,  we  believe  that  terminating  simulations  are 
atso  important.  We  have  discovered,  by  talking  to  a  large  number  of  simulation  practitioners, 
that  a  significant  proportion  of  simulations  in  the  real  world  are  actually  of  the  terminating  type. 
The  following  are  some  reasons  why  a  steady-state  analysis  may  not  be  appropriate: 

a)  The  system  under  consideration  is  physically  terminating.  In  this  case,  letting  the  length 
of  a  simulation  be  arbitrarily  large  makes  no  sense. 

b)  The  input  distributions  for  the  system  change  over  time.  In  this  case,  steady-state  meas¬ 
ures  of  performance  will  probably  not  exist. 

c)  One  is  often  interested  in  studying  the  transient  behavior  of  a  system  even  if  steady-state 
measures  of  performance  exist. 

4.  STOPPING  RULES  FOR  TERMINATING  SIMULATIONS 

In  the  following  three  subsections  we  consider  procedures  that  can  be  used  to  construct 
confidence  intervals  (c.i.’s)  for  measures  of  performance  for  terminating  simulations.  We  will 
not  consider  the  steady-state  case  since  it  has  been  widely  discussed  in  the  simulation  literature. 
For  surveys  of  fixed  sample  size  and  sequential  procedures  that  can  be  used  to  construct  c.i.'s 
for  steady-state  measures  of  performance,  see  Law  and  Kelton  [9,  10).  The  random  numbers 
used  in  the  remainder  of  this  paper  were  generated  from  the  generator  discussed  in  [8], 
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A.  Fixed  Sample  Size  Procedures 

Suppose  we  make  n  independent  replications  of  a  terminating  simulation.  The  indepen¬ 
dence  among  replications  is  accomplished  by  using  different  random  numbers  for  each  replica¬ 
tion  and  by  starting  each  one  with  the  same  initial  conditions.  If  X,  is  the  estimator  of  interest 
from  the  »'  th  replication  ( r  —  1.2,  ....  «),  then  the  Xt's  are  independent  identically  distributed 

m 

(i.i.d.)  r.v’s.  (For  the  M/M/1  queue,  X,  might  be  the  average  £  DJ m  or  the  proportion 

7-1 

£  Yj(x)/m.)  If,  in  addition,  the  X,'s  are  normally  distributed,  then  a  100(1  -a) %  (0  <  a  <  1) 

7-1 

c.i.  for  n  -  E(X)  is  given  by 

(1)  X(n)  ±  r„- i.i-a/2  y/s2(n)/n, 

where  X(n)  and  s2(n)  are  the  usual  sample  mean  and  variance,  respectively,  and  is 

the  1  -  a/ 2  point  for  a  /  distribution  with  n- 1  degrees  of  freedom. 

In  practice  the  X's  will  not  be  normally  distributed  and  the  c.i.  given  by  (1)  will  be  only 
approximate.  To  investigate  the  effect  of  nonnormality,  we  simulated  the  three  stochastic 
models  of  Section  2.  For  the  M/M/1  queue  with  p  -  .9,  the  (s,S)  inventory  system,  and  the 
reliability  model,  respectively,  the  quantities  of  interest  were  d(25| A(0)  -  0)  -  2.12, 
e(12|/i  -  S)  —  99.52,  and  £(T|all  components  are  new)  —  .778.  (See  18]  for  a  discussion  of 
how  to  compute  the  first  two  quantities.)  For  each  model  we  performed  500  independent  simu¬ 
lation  experiments,  for  each  experiment  we  considered  n  —  5,  10,  20,  40,  and  for  each  n  we 
used  (1)  to  construct  a  90%  c.i.  for  the  desired  quantity.  In  Tables  1,  2,  and  3  we  give  the  pro¬ 
portion,  p,  of  the  500  c.i.’s  that  covered  the  desired  quantity,  a  90%  c.i.  for  the  true  coverage, 
and  the  average  value  of  the  c.i.  half  length  divided  by  the  point  estimate  over  the  500  experi¬ 
ments  for  the  three  models.  The  90%  c.i.  for  the  true  coverage  was  computed  from 

p  ±  1.645  Vp(l-p)/500. 

Observe  that  for  the  M/M/1  queue  and  the  (s,S)  inventory  system  the  coverages  are  quite 
close  to  90%,  but  for  the  reliability  model  there  is  a  significant  degradation  in  coverage, 
apparently  caused  by  a  severe  departure  from  normality.  To  see  if  this  is  indeed  the  case,  we 
generated  1000  X,'s  for  each  stochastic  model  and  estimated  the  skewness  and  kurtosis.  These 
estimates,  which  are  presented  in  Table  4,  indicate  that  the  Xfs  for  the  reliability  model  are 
considerably  more  nonnormal  than  are  those  for  the  other  two  models.  This  conclusion  was 
reinforced  by  plotting  histograms  for  the  three  sets  of  data. 


T ABLE  1 .  Fixed  Sample  Size  Results 
for  d(25)\N(0)=0)  =  2.12, 
M/M/1  Queue  with  p  =  .  9 


n 

coverage 

r  c.i.  half  length 

average  of - = - 6 — 

X(n) 

5 

.880  ±  .024 

.672 

10 

.864  ±  .025 

.436 

20 

.886  ±  .023 

.301 

40 

.914  ±  .021 

.212 
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TABLE  2.  Fixed  Sample  Size  Results  for 
e(l2\f  —  S)—  99.52,  (s,S)  Inventory  System 


n 

coverage 

r  c.i.  half  length 

average  of - = - 6 — 

X(n ) 

5 

.908  ±  .021 

.048 

10 

.904  ±  .022 

.031 

20 

.880  ±  .024 

.021 

40 

.894  ±  .023 

.014 

TABLE  3.  Fixed  Sample  Size  Results  for 
E(T\all  components  new)=  .7 78, 
Reliability  Model 


n 

coverage 

_  <■  c.i.  half  length 

average  of - = - B — 

X(n) 

5 

.708  ±  .033 

1.163 

10 

.750  ±  .032 

.820 

20 

-800  ±  .029 

.600 

40 

.840  ±  .027 

.444 

TABLE  4.  Skewness  and  Kurtosis  for  the  Three 
Stochastic  Models  and  the  Normal  Distribution 


Stochastic  Model 
or  Distribution 

Skewness 

Kurtosis 

Normal  Distribution 

0* 

3* 

M/M/1  Queue 

1.66 

6.43 

(s,S)  Inventory  System 

.45 

3.76 

Reliability  Model 

5.18 

54.39 

‘Theoretical  Values 


B.  Relative  Width  Procedures 

One  disadvantage  of  the  fixed  sample  size  approach  to  constructing  a  c.i.  is  that  the  simu¬ 
lator  has  no  control  over  the  c.i.  half  length;  for  fixed  n,  the  half  length  will  depend  on  the 
population  variance  o-2=  Var(X).  In  this  subsection  we  consider  two  sequential  procedures 
which  allow  one  to  specify  the  "relative  precision"  of  a  c.i.  Both  assume  that  X\,  X2  ...  is  a 
sequence  of  i.i.d.  r.v.’s  which  need  not  be  normal. 

The  first  procedure  has  been  suggested  for  use  in  several  different  contexts;  see  Iglehart 
[5],  Lavenberg  and  Sauer  [7],  and  Thomas  [13].  The  objective  of  the  procedure  is  to  construct 
a  100(1 -a)%  c.i.  for  /*_such  that  the  difference  between  the  point  estimator  X(n)  and  p  is  no 
more  than  100  y%  of  X(n),  that  is, 

[2)  \X(n)  —  <  y|^(w)|  for  0  <  y  <  1. 
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Choose  an  initial  sample  size  n0  ^  2,  let 

8,  ,(w,a)  —  i_„/2  yfsHn)ln. 

and  let 

/-.v  f  ,  S.  i(/t,a) 

(3)  N,  ,(>,<*)  -  min  In:  n  >  n0,  s2(n )  >  0,  —  <  y 

(  MT(n)| 

(Note  that  jV,  , (-y .  a),  which  is  the  required  number  of  replications,  is  a  r.v.).  Then  use 

(4)  I,  \(y.a)  =  [T(iV,  ,(-y,a))  -  8ri(Nr\{y,a),a),X(Nr\(y,a))  +  8,  \(Nr |(y,o),a)) 

as  an  approximate  100(l-a)%  c.i.  for  n.  It  easily  follows  from  (3)  and  (4)  that  lr  i(y,a) 
satisfies  the  criterion  given  by  (2).  Furthermore,  using  an  argument  similar  to  the  one 
employed  by  Lavenberg  and  Sauer  in  the  context  of  the  regenerative  method  for  steady-state 
simulations,  we  have  been  able  to  prove  the  following  theorem. 

THEOREM  1.  If n  ^  0  and  0  <  a2  <  °°,  then  lim  P(/jl  €  lr  |(y,a))  =  1  —  a. 

>-o+ 

The  objective  of  the  second  procedure,  which  is  due  to  Nadas  (11],  is  to  construct  a  c.i. 
such  that 


|A,(n)-/i.|  <  y  ImI  for  0  <  y  <  1. 


v2(n)  =  ]l  +  £  \Xj  —  A'(«)]2|/«  =  A/n)  +  (n  —  l)s2(w)/n, 


8r  2(n.a)  =  t„-\.\-a/2  y/v2(n)/n. 


Then  use 


jVr  2 ("y • « )  =  min  \n:n  ^  n0,  -f= 


8, 2(w,a) 


(6)  ,  (ya)mm  X(Nr,2(y.a))  XWr_2(y,a)) 

' 2Ky’  ’  l+y  ’  1  —  y 

as  an  approximate  100(l-a)%  c.i.  for^t.  From  (6)  it  is  easy  to  show  that  lr2(y,a)  satisfies  the 
criterion  given  by  (5).  Furthermore,  the  following  theorem  was  proved  by  Nadas. 


THEOREM  2.  If  pt  ^  0  and  0  <  a-2  <  °°,  then  lim  P{u  €  lr  2(y,a))  =  1  —  a. 

y-0+ 


In  order  to  compare  the  two  procedures  and  to  determine  the  effect  of  non-infinitesimal  y 
on  coverage,  we  once  again  simulated  the  three  stochastic  models.  For  each  model  we  per¬ 
formed  500  independent  experiments,  for  the  M/M/1  queue  and  the  reliability  model  we  con¬ 
sidered  y  =  .2,  .1,  .05  for  each  experiment,  and  for  the  inventory  system  we  considered  y  = 
.2,  .1,  .05,  .025,  .0125,  .00625  for  each  experiment.  In  all  cases,  n0  »  5.  In  Tables  5,  6,  and  7 
we  give  point  estimates  and  90%  c.i.'s  for  the  true  coverages,  point  estimates  and  90%  c.i.'s  for 
£{/V,,(y,a))  (/  -  1,  2),  and  the  average  c.i.  half  lengths  over  the  500  experiments.  We  con¬ 
sidered  more  values  of  y  for  the  inventory  system  because  it  appeared  from  our  empirical 
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TABLE  5.  Relative  Width  Results  for  d(25\N(0)  =  0)  =  2.12, 
M/M/1  Queue  with  p  —  .  9 


Procedure  1 

Procedure  2 

y 

E[NrJ(y.a)) 

coverage 

average  c.i. 
half  length 

£{/V,2(y,a)| 

coverage 

average  c.i. 
half  length 

.2 

42.3  ±  0.9 

.842  ±  .027 

mm 

41.9  ±  0.8 

.862  ±  .025 

.437 

.1 

175.2  ±  1.7 

.860  ±  .026 

Mm 

174.5  ±  1.7 

.868  ±  .025 

.213 

.05 

704.4  ±  3.5 

.884  ±  .024 

.106 

703.7  ±  3.5 

.882  ±  .024 

.106 

TABLE  6.  Relative  Width  Results  for  e(12\  I  \  =  S)  =  99.52, 
(s,  S )  Inventory  System 


Procedure  1 

Procedure  2 

y 

coverage 

average  c.i. 
half  length 

£{Ar  2(y,a)} 

coverage 

average  c.i. 
half  length 

.2 

5.0  ±  0.0 

.902  ±  .022 

4.89 

5.0  ±  0.0 

1.0 

20.74 

.1 

5.0  ±  0.0 

.902  ±  .022 

4.86 

5.0  ±  0.0 

1.0 

10.06 

.05 

5.9  ±  0.1 

.892  ±  .023 

3.97 

5.7  ±  0.1 

.962  ±  .014 

4.99 

.025 

13.3  ±  0.4 

.834  ±  .027 

2.35 

12.3  ±  0.4 

.858  ±  .026 

2.48 

.0125 

51.0  ±  1.0 

.856  ±  .026 

1.23 

49.8  ±  1.0 

.862  ±  .025 

1.24 

.00625 

206.3  ±  1.8 

.872  ±  .025 

0.62 

205.4  ±  1.8 

.876  ±  .024 

0.62 

TABLE  7.  Relative  Width  Results  for  E(T\a!l  components  new)  =  .  778, 

Reliability  Model 


Procedure  1 

Procedure  2 

y 

E{Nr]  (y.a)} 

coverage 

average  c.i. 
half  length 

£{/V,2(y,a)} 

coverage 

average  c.i. 
half  length 

D 

213.7  ±  4.5 

.876  ±  .024 

.152 

214.1  ±  4.5 

.908  ±  .021 

mm 

,i 

907.4  ±11.2 

.898  ±  .022 

.077 

908.6  ±  10.8 

.902  ±  .022 

BH 

.05 

3720.5  ±  23.7 

.882  ±  .024 

.039 

3720.0  ±  23.7 

.884  ±  .024 

results  that  a  smaller  y  is  required  for  the  coverage  ultimately  to  converge  to  the  desired  level. 
(A  smaller  y  is  required  for  this  model  to  get  a  large  value  of  Nri(y,a).)  Note  also  that  con¬ 
vergence  of  coverage  does  not  appear  to  be  monotone. 

We  repeated  the  above  500  experiments  using  the  same  random  numbers  and  n0  =  2. 
For  procedure  2  the  results  were  identical;  however,  for  procedure  1  there  was  a  significant 
degradation  in  coverage  due  to  premature  stopping  on  replications  2,  3,  or  4.  For  example,  the 
coverage  for  the  M/M/1  queue  with  y  =  .2  was  .798. 

C.  Absolute  Width  Procedures 

In  this  subsection  we  present  two  procedures  which  allow  one  to  construct  a  1 00 ( 1  — « ) % 
c.i.  for  p.  such  that 

(7)  \X(n)-p\  <  c, 

where  c  is  a  specified  positive  number. 
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The  first  procedure,  which  is  due  to  Chow  and  Robbins  [1],  assumes  that  X\,  X2,  . 
sequence  of  i.i.d.  r.v's.  Choose  n0  ^  2.  Let  v2(n)  be  defined  as  in  Subsection  4.B,  let 


. .  is  a 


Na  |(c,  a)  =  min 


n:n  >  n0,  v2(n)  < 


c2n 


and  then  use 


(l/t—l, 1— a/2' 

4i(c,  a)  =  [X(Na  l(c,  a))  -  c,  X(N^(c,  «))  +  c] 


as  an  approximate  100(l-a)%  c.i.  for/x.  It  is  clear  that  /a  ,(f,a)  satisfies  the  criterion  given  by 
(7).  The  following  theorem  was  proved  by  Chow  and  Robbins. 

THEOREM  3.  If  0  <  o-2  <  »,  then  lim  P{/x  €  la  ](c,a)}  =  1  —  a.  For  an  empirical 

r— O'* 

evaluation  of  the  above  procedure  under  the  assumption  that  the  A,’s  are  normal,  see  Starr 

[121. 


The  second  procedure,  which  is  due  to  Dudewicz  [2],  assumes  that  the  X,'s  are  i.i.d  nor¬ 
mal  r.v.’s.  Initially  make  n0(n0  >  2)  replications  of  the  simulation  and  compute  X(n0)  and 
s2(n0).  Let 

N„.2 (c,a)  =  max{n0+  1,  [wV(n0)]}, 


where  w  =  ^0_u-a/2/c  and  fzl  is  the  smallest  integer  >  z.  Make  Na2(c,a)  -  «0  additional 
replications  of  the  simulation,  let 

Na  2(c'o) 

Y(Na  2(c,a)  -  n0)  =  £  XJ (Ma  2(c,a)  -  n0), 

i-n  o+l 

and  let  X(Na  2(c,a))  =  a^X(n0)  +  a2Y(Na  2(c,a)  -  w0),  where 


"o 

Na  2(c,a) 


it+y  i- 

A/a  jfc.a)  -  n0  J 

wJs2(w0)  J 

and  a2  =  1  -  at.  Then  use 

la  2(c,a)  =  [X(Na,2(c,a))  -  c,  X(\a2(c,a))  +  c] 
as  an  approximate  100(1— «)%  c.i.  for/x.  Dudewicz  has  proved  the  following  theorem. 

THEOREM  4.  P[/i  e  /,  2(c,a)}  =  1  —  a  for  all  c  >  0. 

To  compare  the  sequential  procedure  of  Chow  and  Robbins  and  the  two-stage  procedure 
of  Dudewicz,  we  performed  500  independent  experiments  for  each  model.  To  make  the  abso¬ 
lute  width  results  somewhat  comparable  to  the  relative  width  results,  we  chose  the  values  of  c 
to  correspond  to  the  values  of  y;  that  is,  for  each  y  we  chose  c  -  y/x.  For  the  Chow  and  Rob¬ 
bins  procedure  we  chose  n0  —  5  and  for  the  Dudewicz  procedure  we  considered  n0  —  15,  30, 
and  60.  (Dudewicz  [3]  recommended  that  n0  be  at  least  12.)  The  results  of  the  simulation 
experiments  for  the  three  models  are  given  in  Tables  8,  9,  10. 
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TABLE  8.  Absolute  Width  Results  for  d(25\  N(0)  —  U)  —  2.12, 
Ml  MU  Queue  with  p  —  .  9 


Chow  and 

Robbins 

Dudewicz 

c 

(c.a)) 

coverage 

"0 

coverage 

15 

49.9  ±  2.1 

.850 

± 

.026 

.425 

38.0 

±  1.2 

.800  ± 

.029 

30 

48.2  ±  1.3 

.912 

± 

.020 

60 

62.1  ±  0.4 

.926 

± 

.019 

15 

196.9  ±  8.5 

.854 

± 

.026 

.212 

173.5 

±  2.5 

.898  ± 

.022 

30 

185.7  ±  5.6 

.888 

± 

.023 

60 

182.9  ±  4.0 

.894 

± 

.023 

15 

786.1  ±  34.2 

.868 

± 

.025 

.106 

706.8 

±  4.8 

.906  ± 

.021 

30 

741.1  ±  22.6 

.878 

± 

.024 

60 

730.2  ±  15.7 

.898 

± 

.022 

TABLE  9.  Absolute  Width  Results  for  e( 1 2 1/,  -  S)  =  99.52. 
(s,.S7  Inventory  System 


Chow  and  Robbins 

Dudewicz 

c 

£( Na.  ,(<•.«)) 

coverage 

"0 

E[NttJc.a)) 

coverage 

15 

16.0  ±  0.0 

.936  ± 

.018 

19.90 

5.0  ±  0.0 

1.0 

30 

31.0  ±  0.0 

.878  ± 

.024 

60 

61.0  ±  0.0 

.888  ± 

.023 

15 

16.0  ±  0.0 

.936  ± 

.018 

9.95 

5.0  ±  0.0 

1.0 

30 

31.0  ±  0.0 

.880  ± 

.024 

60 

61.0  ±  0.0 

.890  ± 

.023 

15 

16.0  ±  0.0 

.922  ± 

.020 

4.98 

5.7  ±  0.1 

.976  ±  .01 1 

30 

31.0  ±  0.0 

.882  ± 

.024 

60 

61.0  ±  0.0 

.886  ± 

.023 

15 

18.5  ±  0.3 

.908  ± 

.021 

2.49 

12.3  ±  0.4 

.880  ±  .024 

30 

31.0  ±  0.0 

.894  ± 

.023 

60 

61.0  ±  0.0 

.882  ± 

.024 

15 

60.8  ±  2.0 

.904  ± 

.022 

1.24 

48.3  ±  1.1 

.872  ±  .025 

30 

55.0  ±  1.3 

.912  ± 

.020 

60 

62.9  ±  0.4 

.902  ± 

.022 

15 

241.7  ±  8.1 

.912  ± 

.020 

0.62 

204.4  ±  1.8 

.896  ±  .022 

30 

217.9  ±  5.1 

.898  ± 

.022 

60 

211.5  ±  3.4 

.912  ± 

.020 

5.  SUMMARY  AND  CONCLUSIONS 

We  have  defined  terminating  and  steady-state  simulations  and  have  discussed  some  com¬ 
mon  measures  of  performance  for  each  type.  In  addition,  we  have  concluded  from  talking  with 
simulation  practitioners  that  a  significant  proportion  of  real-world  simulations  are  of  the  ter¬ 
minating  type.  This  is  fortunate  because  it  means  that  classical  statistical  analysis  for  i.i.d. 
observations  (e.g.,  confidence  intervals,  hypothesis  testing,  ranking  and  selection,  etc.)  is  appli¬ 
cable  to  analyzing  many  simulations.  On  the  other  hand,  in  the  steady-state  case  there  is  still 
not  a  totally  acceptable  procedure  even  for  the  relatively  simple  problem  of  constructing  a  c.i. 
for  a  steady-state  expected  average. 
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TABLE  10.  Absolute  Width  Results  for  E(T\all  components  new)  =  .778, 

Reliability  Model 


Chow  and  Robbins 

Dudewicz 

c 

£{(Va.,(c,a)} 

coverage 

«o 

E{Na.2(c,a) ) 

coverage 

15 

246.0  ±  27.2 

.704  ±  .034 

.156 

179.5  ±  7.0 

.774  ±  .031 

30 

220.8  ±  17.0 

.772  ±  .031 

.  __ 

60 

231.7  ±  14.9 

.812  ±  .029 

15 

981.8  ±  109.0 

.728  ±  .033 

.078 

888.0  ±  14.5 

.900  ±  .022 

30 

880.6  ±  68.0 

.794  ±  .030 

60 

922.2  ±  59.6 

.838  ±  .027 

15 

3925.6  ±  435.8 

.772  ±  .031 

.039 

3672.1  ±  32.9 

.884  ±  .024 

30 

3520.9  ±  272.0 

.788  ±  .030 

60 

3687.2  ±  238.5 

.832  ±  .028 

We  have  also  considered  procedures  for  constructing  c.i.’s  for  terminating  simulations.  If 
one  is  performing  an  exploratory  experiment  where  precision  of  the  c.i.  may  not  be 
overwhelmingly  important,  then  we  recommend  using  a  fixed  sample  size  procedure.  However, 
if  the  X,'s  are  highly  nonnormal  and  if  the  number  of  replications  n  is  too  small,  then  the  actual 
coverage  of  the  constructed  c.i.  may  be  considerably  lower  than  that  desired  (see  Table  3). 

If  one  wants  a  c.i.  having  half  length  that  is  small  relative  to  the  point  estimate,  then  a 
relative  width  procedure  may  be  used.  We  recommend  using  Procedure  2  (due  to  Nadas)  with 
«o  ^  5.  Procedure  2  appears  to  give  slightly  betteT  coverage;  its  criterion  (see  (5))  is  more 
intuitive  than  the  criterion  of  Procedure  1  (see  (2)),  and  Procedure  2  does  not  seem  subject  to 
premature  stopping  even  for  rt0  =  2.  (On  the  other  hand.  Procedure  1  uses  a  more  intuitive 
expression  to  construct  a  c.i.) 

If  one  wants  a  c.i.  for  which  the  half  length  is  a  specified  number,  then  an  absolute  width 
procedure  may  be  used.  We  recommend  using  the  Chow  and  Robbins  procedure  with  n0  >  5. 
Their  procedure  generally  requires  a  smaller  average  sample  size,  the  variance  of  the  sample 
size  is  smaller,  and  its  coverage  seems  to  be  less  affected  by  departures  from  normality  (see 
Table  10). 

In  general,  we  believe  that  relative  width  procedures  are  more  useful  than  absolute  width 
procedures  due  to  the  difficulty  in  specifying  the  absolute  width  c  for  most  simulation  experi¬ 
ments.  When  using  either  the  Nadas  procedure  or  the  Chow  and  Robbins  procedure,  we 
believe  that  it  is  advisable  to  choose  a  y  or  c  which  will  cause  the  procedure  to  run  until  the 
sample  is  at  least  of  moderate  size,  perhaps,  at  least  30.  (Since  both  procedures  are  based  on 
the  central  limit  theorem,  it  is  unreasonable  to  think  that  they  will  work  well  in  general  for  a 
small  sample  size;  see  the  results  for  y  =  .025  in  Table  6.)  Note  that  precise  c.i.’s  may  be 
unaffordable  in  the  real  world  due  to  the  high  cost  of  making  a  single  replication. 

Our  conclusions  on  the  efficacy  of  the  procedures  are  based  on  only  three  models  and, 
thus,  should  be  considered  tentative.  However,  since  the  performance  of  a  procedure  depends 
only  on  the  distribution  of  an  X,  and  not  on  the  complexity  of  a  model,  we  feel  that  there  is  no 
particular  reason  to  suspect  that  the  results  for  real-world  models  should  differ  significantly 
from  those  for  the  simple  models  presented  here. 
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ABSTRACT 

This  paper  explores  computational  implications  of  allowing  nonlinear  objec¬ 
tive  functions  in  the  transportation  problem.  Two  types  of  nonlineoritics,  in¬ 
cluding  polynomials,  are  studied.  The  choice  of  these  functions  resulted  from 
our  interest  in  models  of  integrated  water  management.  Zangwill's  convex 
simplex  method  and  the  primal  method  of  transportation  problem  form  the 
basis  of  our  algorithm  Innovative  features  of  our  work  arc  compact  storage 
and  efficient  compulation  procedures.  We  study  the  effects  on  computation 
time  of  problem  size;  the  density  of  nonlinear  terms;  the  size  of  tolerances  for 
stopping  rules;  and  rules  for  choice  of  new  variables  to  enier  the  solution.  We 
find  that  problems  up  to  95  x  95  in  size  are  capable  of  reasonably  fast  solution. 

A  particularly  surprising  finding  is  that  one-dimensional  search  for  improving 
solutions  performs  adequately,  at  least  for  the  kinds  of  problems  posed  in  this 
paper.  We  are  encouraged  by  our  results  and  believe  that  models  involving 
nonlinear  objective  functions  may  be  tractable  even  for  relatively  large  prob¬ 
lems,  thus  making  possible  more  accurate  descriptions  of  real  situations. 


I.  INTRODUCTION 

The  recent  advent  of  sophisticated  codes  [3],  [7]  for  solving  the  transportation  problem 
makes  it  possible  to  solve  extremely  large  problems  of  this  type.  This  makes  the  transportation 
model  a  powerful  modeling  device.  In  some  instances,  however,  a  real  situation  demands  aug¬ 
mentation  of  the  transportation  problem.  In  particular,  such  effects  as  economies  or 
diseconomies  of  scale  and  other  cost  interactions  among  variables  cannot  be  modeled  in  the 
transportation  framework.  We  feel  that  the  ability  to  solve  large  transportation  problems  that 
incorporate  nonlinear  cost  functions  will  represent  an  important  addition  to  the  usefulness  of 
transportation  models  in  many  situations.  Our  own  motivation  for  solving  problems  such  as 

*W.  would  like  to  thank  Professors  V.  Srinivasan  of  Stanford  University  and  G.L  Thompson  of  Carnegic-Mcllon 
University  for  making  available  their  code  for  the  Transportation  problem 
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this  stems  from  work  in  water  recycling  models  [4],  As  will  be  seen  later,  we  attempt  to  handle 
two  types  of  nonlinearities  in  the  objective  function.  One  of  these  types  consists  of  polynomials 
and  is  thus  quite  general. 

Our  algorithm  represents  a  combination  of  the  primal  transportation  method  [1]  and  the 
convex  simplex  method  of  Zangwill  [8].  In  developing  a  computational  procedure  we  use  as 
the  basic  building  block  the  methods  of  Srinrvasan  and  Thompson  [6],  (7],  Where  possible  we 
use  their  efficient  coding  procedures  for  the  linear  problem.  Our  innovations  include  the  way 
we  store  the  information  required  for  the  nonlinear  objective  function  in  an  efficient  manner, 
and  in  performing  the  nonlinear  calculations  in  ways  we  think  will  minimize  their  impact  on 
total  computation  time.  In  developing  these  techniques  we  have  drawn  from  some  of  the  ideas 
in  Shaftel  and  Thompson  [5].  We  report  computation  times  for  problems  with  up  to  95  rows 
and  95  columns.  Though  computation  times  are  larger  than  those  for  similar  sized  linear  prob¬ 
lems,  they  compare  favorably  with  what  was  considered  acceptable  for  the  linear  transportation 
problem  only  a  few  years  ago.  To  our  knowledge  computational  experience  on  transportation 
problems  with  general  nonlinear  objective  functions  is  not  heretofore  available.  In  this  light 
our  study  is  significant.  We  believe  that  it  will  prove  useful  to  those  engaged  in  developing 
models  with  a  view  to  computational  viability. 

We  report  on  computation  times  for  some  of  the  subroutines  used  in  our  code  in  addition 
to  overall  solution  time  so  as  to  isolate  areas  for  further  improvements.  We  will  also  discuss 
our  experience  in  using  different  parameters,  such  as  stopping  rule  tolerance,  in  the  code.  We 
believe  that  many  improvements  to  this  code  can  be  made,  and  that  even  faster  times  than 
those  reported  here  are  possible. 

2.  PROBLEM  STATEMENT 


The  problem  that  we  are  interested  in  can  be  stated  as  follows:  find  a  schedule  of  ship¬ 
ments  or  flows  of  a  homogeneous  good  from  a  set  of  m  source  nodes  (often  thought  of  as 
warehouses)  to  a  set  of  n  sink  nodes  (often  thought  of  as  markets)  to  minimize  the  total  cost  of 
effecting  the  shipments.  We  denote  x„  to  be  the  flow  from  /  to  j,  and  a,  and  6,  to  be  the  avail¬ 
ability  and  demand  at  nodes  /  and  j,  respectively.  The  objective  function  to  be  minimized  con¬ 
sists  of  a  linear  component  c„,  the  cost  of  unit  shipment  from  /  to  j,  and  some  other  costs  that 
are  nonlinear  in  the  decision  variables  x,j.  We  note  that  the  usual  transportation  problem  has 
only  linear  costs.  See  Dantzig  [2],  for  example.  Mathematically,  we  can  write  our  problem  as: 

Minimize 


m  n 

(1)  Y>'TtciJxiJ  Linear  Terms 

>-i  /-i 

+  i4nn  (x,j) Polynomial  Terms 

/-i  /-i  /—i 

(a  total  of  T  terms) 


+  I A 


/-I 


n 

L  ku  xu 

7-1 


Demand  Interdependencies 


+  £  sj  L  'v x 


Supply  Interdependencies. 
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Subject  to 

4 

(2)  X  x,j  =  a,  for  /  =  1,  2 . m 

j-i 

(3)  X  x'i  =  bi  for  y  =  1,  2 . n 

/-i 

(4)  x „  >  0  for  all  1  and  j. 

We  assume  that  X  a,  “  X  V  ,n  addition  we  require  the  following: 

/  j 

a,  ^  0  for  all  / 

/3,  >  0  for  all  j 

k,j,  I,,  ^  0  for  all  i  and  j 

h,ij  >  0  for  all  t,  i  and  j. 

Actually  the  nonnegativity  of  k,,  and  /„  may  be  relaxed  if  a,  and  /3,  are  restricted  to  integers. 
This  would  be  the  case  if  we  are  dealing  with  quadratics,  for  example.  For  convenience  we 
introduce  the  following  additional  notation: 

R,  =  X  k'J  xu  for  /  =  1,  2 . m 

i~\ 

m 

C,=  X  l0  X'J  for  7  =  1.2 . n 

/-I 

Although  our  purpose  in  this  paper  is  to  examine  the  computational  implications  of  allow¬ 
ing  nonlinear  objective  functions  such  as  (1)  in  the  framework  of  a  transportation  problem,  it  is 
useful  to  motivate  the  choice  of  such  a  function.  As  noted  earlier,  we  first  encountered  the 
need  for  it  in  modeling  integrated  water  management.  The  polynomials  which  we  have  incor¬ 
porated  reflect  the  fact  that  the  use  of  some  arcs  (shipment  routes)  affects  the  cost  of  use  of 
others.  The  transshipment  type  of  model  used  in  water  management  makes  this  a  naturally 
encountered  type  of  objective  function.  The  supply  interdependencies  involve  functions  of  C,, 
which  is  a  linear  combination  of  the  shipments  into  node  j.  As  an  illustration  these  can  be  used 
to  model  a  demand  curve  at  each  of  the  markets.  The  demand  interdependencies  involve  func¬ 
tions  of  R„  which  is  a  linear  combination  of  the  shipments  emanating  from  source  To  illus¬ 
trate  the  use  of  this,  suppose  sink  node  n  is  a  "dummy"  market.  Shipments  from  node  i  to  j, 
j  -  1,2 . n  -1,  can  be  thought  of  as  the  total  production  at  node  i.  The  demand  inter¬ 

dependencies  term  can  then  be  used  to  model  the  total  cost  of  production  at  node  /'. 

We  now  proceed  to  discuss  the  different  steps  in  the  algorithm  and  some  features  of  the 
computer  code  we  have  written  for  the  algorithm. 

3.  DESCRIPTION  OF  THE  ALGORITHM  AND  THE  COMPUTER  CODE 

The  algorithm  for  solving  the  proposed  problem  involves  a  marriage  of  Zangwill's  convex 
simplex  method  (8]  and  the  primal  transportation  method  (1).  We  specialize  this  algorithm  to 
the  transportation  problem.  Our  contribution  is  in  developing  procedures  that  render  the  algo¬ 
rithm  computationally  viable  both  in  respect  of  demands  on  storage  as  well  as  computation 
time.  We  will  therefore  discuss  the  details  of  our  code  at  length,  but  before  that  we  briefly  out¬ 
line  the  various  steps  of  the  convex  simplex  algorithm. 
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The  convex  simplex  method  starts  with  an  initial  basic  feasible  solution  which  is  obtained 
as  for  a  linear  problem.  Let  fix)  be  the  objective  function  to  be  minimized  over  x  in  some 
convex  polyhedral  set.  A  straight  forward  way  to  understand  the  nonlinear  approach  would  be 
to  view  the  objective  function  coefficient  vector  c  of  the  linear  problem  (clearly  V/(x)  -  c)  as 
being  replaced  by  the  gradient  of  the  objective  function  V/Oc)  at  any  iteration  of  the  simplex 
tableau.  If  this  is  done  then  the  reduced  cost  calculation  yields  the  rate  of  change  in  the  objec¬ 
tive  function  with  respect  to  x.  A  variable  which  has  a  negative  reduced  cost  may  be  raised 
causing  the  objective  function  to  decrease. 

In  a  linear  problem,  the  identification  of  a  candidate  variable  is  followed  by  a  pivot  to  a 
new  basis.  For  a  nonlinear  objective  function,  the  optimum  does  not  necessarily  lie  on  an 
extreme  point  of  the  feasible  set.  Thus  the  candidate  variable  provides  a  locally  useful  direction 
along  which  we  search  for  a  step  length.  Having  found  the  extent  of  a  desirable  movement,  the 
candidate  variable  is  raised  to  the  appropriate  value,  and  a  new  solution  point  has  been  found. 
In  the  nonlinear  problem  V/(x)  must  be  calculated  at  each  new  solution  point.  In  addition  to 
increasing  (raising)  variables  with  negative  reduced  costs,  nonzero  (nonbasic)  variables  with 
positive  reduced  costs  may  be  decreased  in  value  in  order  to  reduce  the  objective  function.  At 
any  iteration,  a  new  solution  point  may  be  the  result  of  a  basis  change  or  a  modification  of  the 
values  of  the  basic  variables  without  a  change  of  basis.  The  convex  simplex  method  then,  can 
be  viewed  as  the  following: 

At  any  iteration, 

(i)  Calculate  V/(x).  Using  these,  calculate  the  reduced  costs  associated  with  the  non¬ 
basic  variables. 

(ii)  Find  either  a  raised  (nonzero)  nonbasic  variable  which  does  not  have  a  zero  reduced 
cost  or  a  nonbasic  variable  at  zero  with  a  negative  reduced  cost.  If  none  stop,  the 
optimum  is  obtained. 

(iii)  Raise  or  lower  the  value  of  the  chosen  variable  in  the  direction  which  reduces  the 
objective  function  Find  out  how  far  this  variable  should  be  adjusted  using  a  one 
dimensional  unconstrained  search.  (Of  course,  the  variable  is  restricted  to  be  nonne¬ 
gative).  If  a  basic  variable  goes  to  zero  prior  to  reaching  the  unconstrained  minimum 
go  to  (iv).  Otherwise  go  to  (v). 

(iv)  The  candidate  variable  enters  the  basis,  and  the  basic  variable  driven  to  zero  is 
removed  from  the  basis.  Go  to  (i). 

(v)  Modify  the  values  of  the  basic  variables  and  the  nonbasic  variable  being  raised  or 
lowered.  The  basis  remains  the  same.  Go  to  (i). 


The  stopping  rule  in  (ii)  will  lead  to  a  global  minimum  if  the  objective  function  is  convex. 
Otherwise,  a  local  minimum  or  a  stationary  point  is  arrived  at.  In  effect,  a  Kuhn-Tucker  point 
is  obtained.  Although  we  will  refer  to  the  optimum  solution  in  the  remainder  of  the  paper,  the 
above  qualifications  are  to  be  borne  in  mind. 

An  important  aspect  of  the  work  we  have  accomplished  is  the  modification  of  the  primal 
transportation  solution  technique  to  solve  a  family  of  nonlinear  problems.  We  now  discuss  the 
computational  procedures  in  detail. 
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The  discussion  in  this  section  will  make  use  of  Figure  1  to  a  great  extent.  This  figure 
develops  a  flow  chart  which  shows  the  modifications  necessary  for  accommodating  nonlinear 
objective  functions  in  the  transportation  problem.  The  figure  is  divided  into  three  parts  each 
containing  different  blocks  of  the  flow  chart.  The  first  part  represents  calculations  for  the  linear 
problem  which  are  not  used  in  the  nonlinear  case.  The  second  third  of  the  figure  represents 
those  blocks  which  would  be  used  by  both  linear  and  nonlinear  problems.  The  first  two  parts 
together  represent  a  flow  chart  for  the  linear  transportation  problem.  The  last  part  of  the  figure 
shows  the  added  computation  necessary  for  nonlinear  problems.  Combining  the  last  two  parts 
of  Figure  1  yields  the  flow  chart  for  solving  the  nonlinear  transportation  problem. 

The  flow  chart  in  Figure  1  closely  follows  the  FORTRAN  computer  code  which  we  have 
written.  Thus,  each  section  below  from  3.1  to  3.9  represents  a  subroutine  in  our  code.  Wher¬ 
ever  appropriate,  the  block  number  from  Figure  1  will  be  indicated  in  the  body  of  the  text. 

3.1  Input 

The  input  for  the  problem  consists  of  (a)  the  linear  cost  coefficients,  (b)  the  rim  condi¬ 
tions,  i.e.,  capacities  of  sources  (plants)  and  demands  at  sinks  (markets)  (block  A.l)  and  (c) 
the  nonlinear  part  of  the  objective  function  (block  B.l).  The  linear  cost  coefficients  and  rim 
conditions  for  a  problem  of  size  m  x  n  are  stored  in  a  (m  +  1)  x  («  +  1)  matrix  in  the  usual 
way.  The  storage  of  the  nonlinear  part  however  needs  more  elaboration.  We  will  address  the 
storage  of  polynomial  terms  (type  1)  and  demand  and  supply  interdependencies  (type  2)  in  the 
objective  function  separately. 

3.1.1  Type  1  —  Objective  Function 

For  each  term  /,  in  this  part  of  the  objective  function  we  must  keep  the  following  pieces 
of  information:  (i)  d,  the  constant  and  (ii)  the  exponent  of  each  variable  x„  in  term  t.  If  a 
particular  variable  does  not  occur  in  a  term,  clearly  the  corresponding  exponent  would  be  zero. 
Since  in  most  problems  not  all  variables  will  occur  in  each  term  with  a  nonzero  exponent,  we 
can  take  advantage  of  this  to  significantly  reduce  the  storage  requirements  for  this  part  of  the 
objective  function.  To  do  this,  we  assign  an  index,  v,  to  variable  Jr,,,  where 
v  “  n  x  (/  —  1)  +  j.  Then,  we  create  a  list,  denoted  HL,  that  contains  the  term  in  which  a 
variable  occurs  and  its  nonzero  exponent.  If  it  does  not  occur  in  a  term,  it  is  not  stored.  The 
HL  list  is  ordered  by  the  index  v.  We  then  use  a  pointer  list,  HTL,  in  order  to  extract  informa¬ 
tion  about  each  variable  as  it  is  needed.  In  order  to  accomplish  this  information  extraction,  the 
HTL  list  keeps  the  beginning  and  ending  positions  for  information  regarding  each  variable.  In 
this  fashion  we  need  only  store  the  term  numbers  in  which  a  variable  occurs  with  a  nonzero 
exponent,  the  value  of  that  exponent,  and  a  pointer  list  of  length  m  x  n.  This  as  opposed  to  an 
m  x  n  matrix  for  each  term  of  the  polynomial. 

3.1.2  Type  2  —  Objective  Function 

For  each  row  nonlinear  term,  i  -  1,  ....  m,  the  coefficients  ktJ,  j  =  1 . n  and  the 

coefficients  a,  are  stored.  The  data  for  the  m  rows  along  with  the  exponent  are  stored  in  a 

m  x  (n  +  1)  matrix.  The  data  for  the  column  nonlinear  terms  /y,  i  =  1 . m , 

j  ”  1,  ....  n.  and/3,,  are  similarly  stored  in  a  (m  +  1)  x  n  matrix. 

3.2  INSOL 

This  step  finds  an  initial  feasible  solution  (Block  A. 2).  We  do  this  by  using  the  linear  cost 
coefficients  only.  An  initial  solution  is  found  using  the  Modified  Row  Minimum  Rule  of 
Srinivasan  and  Thompson  [7], 
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Linui  Problem 
Only 


#oth  Linear  end 
Nonlinear  Problem* 


Nonlinear  Problem 

0"'y 


Input  nonlinear  Irmi 


The  figure  shows  necessary  modifications  to  a  linear  transporta¬ 
tion  problem  in  order  to  accommodate  non-linear  objective  Junc¬ 
tions.  Dashed  lines  are  unique  to  nonlinear  problem.  Crossed 
lines  are  unique  to  linear  problem.  Solid  lines  are  mutual. 
Thick-lined  boxes  must  be  modified  for  different  types  of  non¬ 
linear  objective  functions. 


Modify  be**  variable 


Calculate  obtectnre  function  value 
polynomial  term  value*  and  row 
end  column  turn  value* 


Calculate  linear  cott  equivalent  uting 
partial  derivative*  of  obtective  function 
fot  each  ba»*c  variable 


Calculate  linear  com  equivalent  utmg 
partial  derivativei  of  obtective  function 
for  each  no"  and  railed  non  banc 
variable 


a  7 

Seeich  toi  amount  o' 

change  tor  vat  table 

— 

being  modifier! 

Calculate  obtectivi 
function  value  lot 
point!  of  weft 


Update  nonbet*  li»t  if  I 
necenary  and  veHie  of  [—  — 
h««rr  variable!  | 


Figure  1.  Flow  chart  of  computer  code 
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3.3  LABEL  1 

This  step  creates  the  basis  list  in  the  manner  of  a  "pointed  list"  (Block  A. 3).  The  basis  list 
shows  the  information  on  current  basic  cells,  i.e.,  their  row  /,  column  j  and  current  value.  See 
Srinivasan  and  Thompson  [7]  for  more  details. 

3.4  OBJFUN  2 

This  step  calculates  the  value  of  the  nonlinear  part  of  the  objective  function  for  each  of 
the  two  nonlinear  functions  (Block  B.2).  For  the  polynomials  each  term  value  is  calculated 
separately.  These  values  will  be  needed  later  in  the  program  for  finding  the  partial  derivatives 
of  the  objective  function  with  respect  to  each  of  the  variables.  Actually  two  vectors  store  these 
term  values  for  the  polynomial  objective  function.  The  first  vector,  Z,  stores  the  term  values 
which  when  added  will  give  the  current  value  of  the  nonlinear  objective  function.  The  second 
vector,  Zl,  stores  the  term  values  in  a  form  that  enables  the  calculation  of  partial  derivatives. 
This  calculation  will  be  discussed  in  the  next  section.  The  objective  function  value  for  row  and 
column  nonlinear  terms  are  calculated  in  two  steps.  First  the  row  terms  £ k„  xn  for 

'  -  1 . m  and  the  column  terms  £  /,,  x„  for  j  -  1 . n  are  calculated  and  stored  in 

/ 

vectors  RSUM  and  CSUM.  Then  these  sums  are  raised  to  the  appropriate  exponent  values  and 
stored  in  RTERM  and  CTERM.  Once  again  these  stored  values  will  be  extremely  useful  for 
calculation  of  partial  derivatives  discussed  in  the  next  section. 

3.5  LABEL  2 

This  subroutine  of  the  code  accomplishes  three  things.  First,  it  calculates  the  partial 
derivatives  of  the  objective  function  with  respect  to  each  of  the  basic  variables  (Block  B.3). 
Second,  it  labels  the  basis  tree  (Block  A. 4)  using  the  predecessor  successor  index  method  of 
Srinivasan  and  Thompson  [7],  This  tree  is  used  to  determine  the  cycle  created  by  the  addition 
of  a  new  cell  to  the  solution,  i.e.,  a  new  variable  entering  the  basis.  Finally,  it  calculates  the 
current  value  of  the  dual  variables  (Block  A. 4).  In  the  linear  primal  transportation  code  only  a 
part  of  the  tree  basis  must  be  relabled  and  only  a  subset  of  the  dual  variables  needs  to  be 
revised.  For  the  nonlinear  problem,  however,  we  must  use  completely  revised  partial  deriva¬ 
tives  at  each  iteration  in  order  to  calculate  the  duals— in  doing  so  we  also  relabel  the  entire  tree. 

We  now  turn  to  the  technique  we  use  for  calculating  the  partial  derivatives.  We  attempt 
to  determine  the  partial  derivatives  in  an  efficient  way.  For  each  variable,  we  calculate  the 
derivative  with  respect  to  the  linear,  polynomial  and  row  and  column  sum  terms  separately  and 
then  add  them  together.  The  derivatives  associated  with  the  linear  costs  are  of  course  merely 
the  initial  coefficients.  The  derivatives  for  the  two  types  of  nonlinear  functions  are  then  calcu¬ 
lated  as  follows: 


3.5.1  Derivatives  for  Polynomial  Terms 


We  would  like  to  avoid  recalculating  each  polynomial  term  for  each  variable  whose  partial 
derivative  is  calculated.  To  do  this  we  take  advantage  of  the  fact  that  at  each  iteration  the  value 
of  each  polynomial  term  is  known  to  be  some  value  Zl„  and  that  taking  the  derivative  with 


respect  to  a  variable  xu  will  yield  the  result. 


"m 


Zl, 


.  This  provides  us  with  a  very  fast  tech¬ 


nique  for  finding  the  derivative  of  each  term  since  we  perform  just  one  multiplication  and  divi¬ 
sion. 
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By  using  the  HL  and  HTL  vectors  we  can  quickly  find  only  those  polynomial  terms  which 
contain  x,t.  In  the  use  of  this  technique,  problems  arise  whenever  the  value  of  x„  is  zero  and 
the  exponent  hnl  is  less  than  or  equal  to  one.  When  the  exponent  is  greater  than  one  or  when 
more  than  one  zero  valued  variable  occurs  in  a  term  the  partials  are  zero  and  offer  no  computa¬ 
tional  difficulty.  In  the  case  where  a  single  x(/  with  an  exponent  less  than  or  equal  to  one  is  at 
the  value  zero  we  act  as  though  this  variable  is  bounded  to  a  prespecified  small  value  (EPS1) 
until  the  partials  are  found.  It  is  then  returned  to  its  original  value.  (The  storage  of  the  extra 
vector  Z  allows  us  to  perform  this  modification  without  losing  the  true  value  of  the  polynomial 
terms.) 

3.5.2  Derivatives  for  Row  and  Column  Nonlinear  Terms 


The  derivatives  for  these  terms  are  found  in  a  similar  fashion  to  the  polynomial  terms. 
Since  the  row  and  column  sums  are  known,  the  derivative  with  respect  to  any  variable  x„  will 
k  '  a  *  RTF.R  N/f  J 

be  — — 77. 777 - '  and  similarly  for  the  column  terms.  (RTERM  and  RSUM  were  calcu- 

KSUM 

lated  and  stored  earlier  in  OBJFUN  2.)  In  this  case,  if  RSUM  is  zero  the  derivative  is  also 
zero. 


3.6  MAIN 

This  subroutine  of  the  code  first  calculates  partial  derivatives  with  respect  to  the  nonbasic 
variables  (Block  B.4)  in  the  same  fashion  as  described  in  the  last  section  for  basic  variables,  in 
fact  there  will  be  two  types  of  nonbasic  variables,  (i)  Raised  nonbasic  variables  whose  values 
are  not  zero  that  will  be  stored  in  an  array  similar  to  that  for  basic  variables,  and  (ii)  nonbasic 
variables  that  are  at  the  value  zero  (zero  nonbasic  variables).  Once  the  partial  derivative  of  a 
nonbasic  variable  is  known,  its  reduced  cost  may  be  found  using  the  standard  linear  transporta¬ 
tion  dual  equations  and  the  dual  variables  calculated  in  LABEL  2  above  (Block  A. 5).  We  now 
attempt  to  find  a  raised  nonbasic  variable  to  be  raised  or  lowered  (Block  B.5)  or  a  zero  nonbasic 
variable  to  be  raised  (Block  A. 6).  Any  nonbasic  variable  with  a  negative  reduced  cost  is  a  can¬ 
didate  for  being  raised  above  its  current  value.  A  raised,  nonbasic  variable  with  a  positive 
reduced  cost  is  also  a  candidate  for  being  modified.  In  this  case  the  variable  must  be  reduced 
below  its  current  value  in  order  to  improve  the  objective  function.  Of  these  candidates  we 
choose  a  cell  according  to  one  of  the  following  rules: 

(i)  Matrix-minimum:  here  the  cell  that  has  the  smallest  reduced  cost  among  all  cells  is 
chosen, 

(ii)  Row-minimum,  here  cells  are  examined  row  by  row.  As  soon  as  a  candidate  is 
encountered,  the  cell  with  the  lowest  reduced  cost  in  that  row  is  chosen, 

(iii)  Lot-minimum:  here  cells  are  examined  in  specified  lots  of  NP  variables.  As  soon  as 
a  lot  contains  a  candidate  cell,  the  cell  with  the  lowest  reduced  cost  in  that  lot  is 
chosen.  Of  course  the  row-minimum  rule  is  equivalent  to  the  lot  minimum  rule  with 
NP  =  n  where  n  is  the  number  of  columns.  The  lowest  reduced  cost  should  be 
interpreted  as  largest  in  absolute  value. 

More  will  be  said  later  about  the  effects  of  the  choice  of  rule  for  picking  candidate  cells. 


3.7  CYCLE 


In  this  step  we  find  the  cycle  that  the  candidate  cell  forms  with  the  current  basis  using 
Srinivasan’s  and  Thompson's  procedure  (Block  A. 7).  We  isolate  the  cells  whose  shipments  will 
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be  altered  by  modifying  the  candidate  cell.  We  also  find  the  maximum  extent  that  the  candi¬ 
date  cell  can  be  modified.  This  provides  an  upper  bound  for  a  cell  if  it  is  to  be  raised,  or  a 
lower  bound  if  it  is  to  be  lowered  (Block  B.6).  Of  course  in  the  linear  transportation  problem 
there  is  always  a  pivot  to  the  new  basis  which  permits  one  to  alter  shipments  at  this  stage.  In 
the  nonlinear  case  however,  before  we  can  alter  shipments  we  must  determine  the  extent  to 
which  we  wish  to  modify  the  candidate  cell.  We  do  this  in  the  next  step. 

3.8  SEARCH 

In  this  step  we  search  for  a  value  of  the  candidate  variable  that  lies  between  its  current 
value  and  the  bound  obtained  from  the  previous  step.  We  do  this  with  a  one-dimensional  gol¬ 
den  section  search  (Block  B.7).  Although  many  possible  one-dimensional  searches  could  have 
been  chosen,  we  preferred  the  robust  characteristics  of  golden  sections.  For  this  search  we 
have  to  calculate  the  objective  function  for  each  trial  value  of  the  candidate  variable  (Block 
B.8).  This  is  done  in  two  steps.  First  we  calculate  the  value  of  the  objective  function  that  is 
not  affected  by  changes  in  the  shipments  of  the  cells  in  the  cycle.  This  fixed  part  is  calculated 
for  the  linear  and  the  row  and  column  nonlinear  portions  by  setting  all  variables  in  the  cycle  to 
be  zero  and  then  calculating  the  objective  function.  For  the  polynomial  portion  the  same  pro¬ 
cedure  is  used  after  setting  all  variables  in  the  cycle  to  be  one.  Then,  for  each  trial  value  we 
determine  the  shipments  of  the  cells  in  the  cycle  and  augment  the  above  fixed  part  of  the 
objective  function.  This  process  greatly  reduces  the  number  of  computations  needed  for  each 
trial  value.  The  exit  possibilities  from  SEARCH  are: 

(i)  we  pivot  to  a  new  extreme  point, 

(ii)  we  move  to  an  interior  point, 

(iii)  we  are  not  able  to  move  more  than  a  pre-specified  small  value  (EPS2),  i.e.,  the  can¬ 
didate  cell  does  not  lead  to  a  better  solution. 

Actually,  before  we  use  the  method  of  golden  sections  we  check  to  see  if  a  pivot  will 
result  in  a  lower  objective  function.  If  such  is  the  case,  we  pivot  rather  than  look  for  an  inte¬ 
rior  point  with  a  lower  objective.  This  takes  advantage  of  the  speed  of  the  transportation  algo¬ 
rithm,  ignoring  the  difficult  nonlinear  search  as  often  as  possible. 

3.9  UPDATE 

This  step  updates  the  solution.  For  all  exits  for  SEARCH  we  update  the  nonbasic  list  as 
necessary  (Block  B.9).  If  we  pivot  to  a  new  basis  then  we  also  update  the  basis  list  (Block  A. 8). 
We  then  return  to  LABEL  2  and  beg  in  a  new  iteration. 

3.10  Stopping  Rules 

There  are  two  possible  criteria  that  will  terminate  the  algorithm,  (i)  If  no  candidate  vari¬ 
able  with  a  reduced  cost  above  a  certain  prespecified  amount  exists,  the  program  stops  (Block 
S.l).  This  stopping  rule  is  the  normal  rule  for  linear  programs  and  is  invoked  most  frequently 
in  linear  or  nonlinear  problems,  (ii)  When  performing  the  nonlinear  search  (Block  B.7),  it  is 
possible  that  the  candidate  variable  that  is  not  at  its  lower  bound  cannot  be  moved  a  greater 
amount  than  a  prespecified  tolerance  with  a  resulting  improved  objective  value.  When  this 
occurs  the  program  searches  for  another  candidate  cell  among  another  subset  of  NP  variables. 
The  program  stops  whenever  no  candidate  cell  from  these  subsets  leads  to  an  improved  objec¬ 
tive  function  value  (Block  S.2). 
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4.  COMPUTATIONAL  EXPERIENCE 

Computational  experience  is  based  on  random  problems.  Input  for  each  problem 
includes: 

m ,  the  number  or  rows 

n,  the  number  of  columns 

T,  the  desired  number  of  terms  in  the  polynomial 

AXT,  the  average  number  of  variables  per  term 

ADEN,  the  average  number  of  nonzero  row  and  column  coefficients,  k,,  and  I,, 

EPS2,  the  tolerance  for  stopping  rule  (ii) 

EPS3,  the  tolerance  for  stopping  rule  (i)  (set  at  .001  throughout  these  tests) 

NP,  the  number  of  variables  whose  reduced  cost  is  calculated  during  any  pass  of  MAIN. 

With  these  inputs,  test  problems  are  generated  with  the  following  specifications: 

a,,b,  :  random  integer  numbers  between  0  and  100 

c„  :  random  numbers  between  0  and  100  plus  the  row  number  of  that  variable 

(I,  :  random  numbers  between  0  and  30 

h„)  :  random  numbers  between  0  and  3  for  variables  chosen  at  random  to  be  in  each  term 

Si.D^k,,,!,,  :  random  numbers  between  0  and  10 
a,, /3,  :  random  numbers  between  0  and  3 

Figures  2  through  7  summarize  the  results  of  the  computation  runs.  All  numbers  except 
those  with  »  are  based  on  3  runs;  •  indicate  a  single  run  only.  For  each  problem  type  with 
multiple  runs,  the  computation  times  are  the  average  of  total  execution  times  including  time  for 
generating  input  and  are  in  seconds.  The  code  itself  is  written  in  FORTRAN.  All  computation 
was  performed  on  CYBER  70  at  the  University  of  Arizona  computation  center.  Inspection  of 
Figure  7  indicates  that  CPU  time  is  improved  if  only  a  subset  of  reduced  costs  are  calculated. 
Generally  10  to  20%  of  the  variables  seem  about  optimal  in  this  case.  A  number  in  this  range 
was  chosen  for  NP  in  all  the  computational  results  presented.  As  expected,  times  increased 
with  the  finer  tolerance  in  the  search  to  a  point  where  too  fine  a  search  would  not  yield  any 
results  whatever— see  Figure  3.  (These  were  the  only  cases  of  no  consequence.)  Times 
increase  dramatically  as  the  row  and  column  nonlinear  terms  have  more  nonzero  coefficients 
and  thus  become  more  active— see  Figure  4.  As  the  number  of  variables  occurring  in  each 
polynomial  term  increased,  the  computation  time  is  not  greatly  affected.  This  is  seen  in  Figure 

5.  This  situation  is  not  unexpected  since  a  single  zero  variable  in  a  term  gives  it  a  zero  value 
(and  two  zero  variables  make  the  derivative  of  that  term  zero).  Times  decreased  as  the 
number  of  terms  in  the  polynomial  rose— see  Figure  6.  We  suspect  that  the  multiple  random 
terms  in  the  polynomial  somehow  balanced  each  other  so  that  the  cumulative  effect  was  to 
offset  each  other,  thus  leading  to  a  solution  closer  to  an  extreme  point. 

Due  to  the  realities  of  computer  budgets,  more  extensive  tests  were  not  possible.  The 
basic  results,  however,  can  be  found  in  Figure  2.  Total  time  even  for  the  95  x  95  problem  are 
well  within  acceptable  computation  times.  Near  optimal  results  (10%  from  optimum)  were 
obtained  very  quickly.  Also  obvious  is  that  the  bulk  of  the  time  spent  in  solving  the  nonlinear 
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Average 
No.  of  Raised 


Problem 

Size 

Average 

Total 

Time" 

Average 
Time  to  Within 
10%  of  OPT 

Average  Average 
Time  in  Time  in 
Search  Main 

Average 
No.  of  Basic 
Variables 

Non-Basic  Variables 
(%  increase  in  non¬ 
zero  variables) 

10  x  10 

1.52  0(1.02) 

.42 

1.16 

.10 

19 

3.00  (15.7) 

20  x  20 

4.64  0(4.20) 

2.00 

3.31 

.53 

39 

6.33  (16.2) 

30  x  30 

21.02  (18.81) 

10.85 

15.03 

3.00 

59 

16.67  (28.3) 

50  x  50 

56.71  (56.91) 

34.55 

26.74 

25.85 

99 

28.67  (29.0) 

70  x  70* 

198.74  (00.00) 

56.13 

143.24 

41.72 

139 

42.00  (30.2) 

95  x  95* 

354.38  (00.00) 

227.73 

255.19 

76.02 

189 

63.00  (33.3) 

Figure  2.  Results  for  a  Range  of  Problem  Sizes 
EPS2  =  .05;  T  -  20;  A  XT  -  20;  ADEN  =  .9 
NP  =  7  +  20  percent  of  (he  total  number  of  variables 
“Number  in  parenthesis  are  median  times 
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17.07 
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3.533  x  105 

.01 

Solution  not  found 

Figure  3.  EPS2  vs.  Time 

T  =  20,  AXT  =  2,  ADEN 
m  =  n  =  30,  NP  =  150 
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Time 
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.5 

14.68 
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21.30 

Figure  4.  ADEN  vs.  Time 
T  =  20,  EPS2  =  .05,  AXT  =  2, 
m  =  n  =  30,  NP  =  150 
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AXT  Time 


2.5 

20.49 

5.0 

20.89 

20.0 

21.30 

Figure  5.  AXT  vs.  Time 

T  -  20,  ESP2  -  .05,  ADEN 
m  -  n  =  30,  NP  =  150 

T  Time 

10  23.51 

20  20.89 
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Figure  6.  T  vs.  Time 

AXT  -  5.0,  ESP2  =  .05,  ADEN  =  .9, 

m  =  n  -  30,  NP  -  150 
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Figure  7.  NP  vs.  Time 
T  -  20,  AXT  -  20,  ADEN  -'  .9, 
EPS2  —  .05  (—  .10  for  30  x  30  case) 
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problem  is  in  MAIN  where  the  reduced  costs  are  calculated,  and  in  SEARCH.  One  obvious 
area  of  furthe  research  will  be  to  replace  the  relatively  slow  (but  robust)  golden  sections  search 
with  a  faster  type  of  linear  search.  The  final  column  in  figure  2  is  the  number  of  nonbasic  vari¬ 
ables  in  the  optimum  solution  at  a  value  other  than  zero.  We  use  this  as  one  measure  of  the 
nonlinearity  of  the  problems  we  have  chosen  to  solve.  We  also  indicate  the  percent  increase  in 
nonzero  variables  caused  by  raised  nonbasic  variables. 

5,  CONCLUSIONS 

In  this  paper  we  have  shown  that  transportation  problems  with  a  c!  iss  of  nonlinear  objec¬ 
tive  functions  can  be  solved  efficiently  for  problems  large  enough  to  make  them  useful.  We 
believe  in  fact  that  the  results  are  startling.  These  results  are  even  more  impressive  given  the 
common  belief  that  one  variable  at  a  time  searches  cannot  be  useful  in  solving  nonlinear  prob¬ 
lems.  In  the  cases  we  studied,  only  a  small  percentage  of  nonbasic  variables  became  active  so 
that  our  problems  were  in  fact  smallish  nonlinear  problems— nevertheless  there  is  no  way  of 
telling  a  priori  which  variables  will  be  inactive.  Also,  at  the  optimum  the  number  of  raised 
nonbasic  variables  is  significant  in  relation  to  the  number  of  basic  variables. 

Obvious  areas  of  further  research  remain.  In  particular,  replacing  the  slow  golden  section 
search  with  a  more  efficient  routine  will  lead  to  some  improvement.  It  is  also  very  simple  in 
the  present  code  to  modify  the  calculations  to  solve  nonlinear  problems  with  any  differentiable 
objective  function.  Finally,  attempting  to  modify  more  than  one  variable  during  the  nonlinear 
search  presents  an  intriguing  option. 

We  hope  that  this  paper  will  stimulate  others  to  explore  using,  where  appropriate,  more 
extensive  nonlinear  models  in  their  own  problem  solving  studies.  Nonlinear  problems  pose 
interesting  challenges.  Computational  viability  uiltimately  depends  on  exploiting  special  prob¬ 
lem  structures  as  well  as  experimentation  with  computational  procedures.  This  has  been  the 
primary  aim  of  our  research. 
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ABSTRACT 

The  stochastic  vehicle  routing  problem  is  a  problem  of  current  importance 
and  research  interest.  Applications  include  schoolbus  routing,  municipal  waste 
collection,  subscription  bus  scheduling,  daily  delivery  of  dairy  goods,  and  a  host 
of  related  transportation  and  distribution  activities.  In  this  paper,  we  assume 
that  routes  for  vehicles  have  already  been  generated  and  we  focus  on  determin¬ 
ing  operating  strategies.  That  is,  under  what  conditions  should  a  driver  return 
to  the  central  depot  in  order  to  replenish  his  supply?  We  present  a  dynamic 
programming  recursion  which  addresses  this  question  and  we  show  that  the  op¬ 
timal  policy  is  of  a  rather  simple  form.  Finally,  an  algorithm  and  example  illus¬ 
trate  the  policy. 


INTRODUCTION 

The  stochastic  vehicle  routing  problem  is  a  problem  of  current  importance  and  consider¬ 
able  research  interest.  Applications  include  schoolbus  routing,  municipal  waste  collection,  sub¬ 
scription  bus  scheduling,  daily  delivery  of  dairy  goods,  and  a  host  of  related  transportation  and 
distribution  activities.  The  problem  that  has  been  considered  to  date  is  to  determine  a  fixed  set 
of  vehicle  routes  of  minimal  expected  total  distance.  We  assume  that  all  vehicles  leave  from 
and  eventually  return  to  a  central  depot,  and  that  vehicle  capacity  constraints  and  probabilistic 
customer  demands  must  be  satisfied  (for  deterministic  demands  see  Golden,  Magnanti  and 
Nguyen  [1]). 

In  early  work  on  stochastic  vehicle  routing,  Tillman  (4]  introduced  a  heuristic  approach  to 
determine,  under  some  rather  strong  assumptions,  a  fixed  set  of  routes  of  minimal  expected 
total  cost.  The  total  cost  of  traveling  a  route  included  a  cost  associated  with  not  hauling  enough 
goods  to  satisfy  customer  demands  and  a  cost  associated  with  carrying  unnecessary  excess  goods 
on  the  route. 


159 


160 


J.R.  YEE  AND  B  L.  GOLDEN 


Golden  and  Stewart  [2]  address  the  problem  of  determining  a  fixed  set  of  routes  where 
the  demand  of  customer  /'  is  modeled  by  a  Poisson  distribution  with  mean  A,.  The  objective  is 
the  minimization  of  the  expected  total  distance  traveled.  In  Golden  and  Yee  [3],  this  work  is 
extended  and  a  general  framework  for  solving  the  stochastic  vehicle  routing  problem  is  pro¬ 
vided.  The  authors  discuss  a  solution  procedure  for  correlated  demands  and  develop  a  general 
solution  technique  for  the  situation  where  demands  are  Poisson,  negative  binomial,  binomial, 
or  gamma  distributed.  In  addition,  they  derive  analytical  results  which  describe  the  relation¬ 
ships  between  design  parameters  and  offer  a  scheme  for  performing  perturbation  analysis. 

In  previous  work,  whenever  a  vehicle  does  not  have  enough  goods  to  satisfy  a  customer’s 
demand,  it  must  immediately  return  to  the  central  depot  for  reloading  before  supplying  goods 
to  the  remaining  customers  on  the  route.  This,  however,  is  not  always  the  most  cost-effective 
strategy.  It  is  sometimes  judicious  not  to  wait  until  a  vehicle  is  empty  in  order  to  return  to  the 
central  depot  for  reloading.  In  this  paper,  we  determine  conditions  under  which  a  vehicle 
should  return  to  the  central  depot.  These  operating  strategies  lead  to  the  minimization  of 
expected  distance  traveled. 


DISCUSSION 


Suppose  that  the  routes  for  vehicles  have  already  been  determined  by  a  solution  pro¬ 
cedure  such  as  the  Clarke- Wright  algorithm  (see  (1]  for  details).  Each  vehicle  leaves  the  cen¬ 
tral  depot  and  proceeds  to  deliver  goods  to  a  number  of  demand  points.  After  delivering  goods 
to  a  demand  point  on  a  route,  the  driver  is  faced  with  the  decision  of  whether  or  not  to  return 
to  the  depot  in  order  to  replenish  his  supply.  We  shall  prove  that  the  optimal  decision  is  based 
upon  whether  or  not  the  remaining  supply  of  goods  in  the  vehicle  is  greater  or  less  than  some 
critical  value  y  which  must  take  into  account 

(i)  the  probabilistic  demands  on  the  remaining  portion  of  the  route,  and 

(ii)  the  distances  between  customers. 

Let  the  demand  of  customer  /(/  —  1,2 . N)  be  given  by  the  discrete  random  variable 

X;  and  have  some  arbitrary  mass  function  P(Xj  =  k)\  we  remark  that  continuous  probability 
distributions  can  be  handled  analogously.  Each  vehicle  has  a  capacity  of  c  and  starts  out  on  a 
route  with  a  supply  of  c  units.  Without  loss  of  generality,  we  number  the  customers  in  reverse 
order  from  which  deliveries  are  made,  i.e.,  deliveries  are  made  to  customer  N  first,  customer 
N-  1  second,  and  so  on  (customer  0  denotes  the  central  depot).  The  matrix  of  distances 
D  =  [dtJ]  between  customers  i  and  j  is  known  in  advance  and  we  will  assume  that  it  is  sym¬ 
metric  in  order  to  simplify  computations. 


In  general,  after  delivering  goods  to  customer  n  +  1,  let  y„  represent  the  remaining  sup¬ 
ply  of  goods  on  the  vehicle  with  n  customers  still  to  be  visited  and  let  E„(y„)  for 
n  =  0, 1,  ...  ,  A  be  the  expected  distance  to  be  traveled  in  supplying  the  remaining  n  custo¬ 
mers  given  that  there  are  ,y„(0  <  y„  <  c)  remaining  units  of  supply  in  the  vehicle  and  an 
optimal  policy  is  followed;  for  n  =  0,  V„(y„)  is  defined  to  be  d\  0. 


We  now  present  a  dynamic  programming  approacH  for  determining  the  optimal  operating 
policy  for  a  vehicle.  After  delivering  to  customer  n  +  1,  we  have 


(1) 


w 


mm 


4i+I.O  +  d0.„  +  £ 

k- 0 


A)E„_,(r  -  A)\ 
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+  I  m„  =  k)[2d0,n+  K„_,(c-  k  +_y„)+] 

*-o  *->■„+ 1 

where  F(w)+ =  F(max  (O.w)). 

The  recursive  equation  in  (1)  may  be  used  to  determine  the  minimal  expected 

distance  for  completing  the  route  with  y„  remaining  units  of  supply  in  the  vehicle  and  n 
demand  points  still  to  be  serviced.  The  two  options  available  to  the  driver  are  to  go  back  to  the 
depot  and  replenish  his  supply  or  to  proceed  to  the  next  customer  on  the  route.  The  expres¬ 
sion  representing  the  expected  distance  of  first  returning  to  the  depot  includes  the  distance  of 
traveling  from  customer  n  +  1  to  the  depot,  then  from  the  depot  to  customer  n,  and  finally  the 
minimal  expected  distance  with  n  —  1  customers  remaining.  On  the  other  hand,  if  the  driver 
advances  to  the  next  node  directly,  the  expected  distance  includes  the  distance  between  custo¬ 
mers  n  +  1  and  n,  the  distance  from  customer  n  to  the  central  depot  which  might  be  incurred 
twice  as  a  penalty,  and  the  minimal  expected  distance  with  n  -  1  customers  remaining.  It  is 
assumed  that  the  probability  of  any  customer’s  demand  exceeding  vehicle  capacity  is 
infinitesimal. 

In  the  remainder  of  this  section  we  study  properties  of  the  optimal  solution  to  our  prob¬ 
lem.  First,  we  state  the  following  lemma  and  definition. 

LEMMA  1:  2 d0n  -I-  F„_|(a  +  b)  >  Vn-](b),  where  a  and  b  are  positive  integers  and 
a  +  b  <  c. 

PROOF:  After  supplying  customer  n  and  with  b  units  of  supply  remaining  in  the  vehicle, 
the  driver  may  decide  to  return  to  the  depot,  pick  up  an  additional  a  units  of  supply  and  return 
to  customer  n.  Since  this  is  not  necessarily  optimal,  the  result  follows. 

integer y (0  <  y  <  c)  |rf„+i.o  +  do.n  +  Z  p^x»  =  k)VK-\ 

k- 0 

(c- k)+ >  dn+ln+tlP(X„  •  k)Vn_x{y- k)  +  £  m„-  k)[2d0n+  *  +  >-)+] 

*-0  k-.v+l 

or  c  +  1  if  no  such  y  exists. 


We  can  now  state  and  prove  the  general  structure  of  the  optimal  policy. 

THEOREM  1:  After  supplying  customer  n  +  1,  if  the  remaining  supply  y,,  ^  yn,  then  the 
optimal  policy  is  to  proceed  to  the  next  demand  point.  Otherwise  return  to  the  depot. 

PROOF:  We  prove  the  first  part  of  the  theorem  here;  the  second  part  is  proved  in  an 
analogous  fashion.  Suppose  yn  <  c.  Then,  by  definition 

<Wo  +  d0,n  +  £  P(Xn  -  k)  V„_x(c  -  kV  >  dn+i,n  +  £  P(Xn  =  k)  -  k) 

*-o  *-o 


DEFINITION:  yn  =  min 


m  +  £  P(X„  -  kH2d0„  +  Vn.x(c  -  k+ynV]. 

Let  y„  -<  y„  4-  m,  where  m  is  a  positive  integer.  For  the  decision  of  proceeding  to  the  next  cus¬ 
tomer,  the  expected  cost  of  completing  the  route  is 

_  _ \ 
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y„+m 

+  I  m„ 


*-0 


k)V„-\ +  rn  -  k) 


+  X  P(x„  -  k)[2  d0n  +  yn-{(.c  -  k  +  yn  +  m)+]. 

k—yn+m+\ 

Subtracting  (3)  from  the  right-hand  side  of  (2)  results  in 

£  m„  =  -  k)  -  Vn_x(yn  +  m-k )] 

*-o 

+  I  P(Xn  =  Ac ) [2  d0,n  +  F„_,(c  -  A:  4-  j)„)+  -  Vn_,(y„  +  m  -  k)] 

H,+l 

+  X  =  k)\Vn~x(c  —  k  +  y„)+—  y„-\(c  ~  k  +  y„  +  m)+]  >  0. 

Ik— 1 

The  last  inequality  is  obtained  from  Lemma  1  and  the  fact  that  for  all  n 

y„(a  +  m)  ^  y„(a),  where  a,m  >  0  and  a  +  m  <  c. 

If  yn  =  c  +  1.  then  the  triangle  inequality  is  violated  since  d„+\.„  >  </B+l>0  +  d0.„  and  the 
optimal  decision  is  to  return  to  the  central  depot. 

From  the  previous  theorem,  the  following  algorithm  becomes  apparent. 

OPERA  TING  STRA  TEGY  ALGORITHM: 

Step  0.  n  —  0.  F0(y)  =  d{  0(y  =  0, 1,  . . .  ,  c). 

Step  1.  n  —  n  +  1;  determine  Y„(y)(y  =  0, 1,  ....  c)  and  y„. 

Step  2.  If  n  <  N  -  1,  go  to  step  1;  otherwise  compute  and  stop. 

EXAMPLE. 

In  this  section,  we  illustrate  the  algorithm  by  presenting  data  and  results  for  a  small  sam¬ 
ple  problem  in  order  to  gain  more  insight  into  the  performance  of  this  simple  dynamic  pro¬ 
gramming  algorithm.  We  consider  a  vehicle  routing  problem  with  four  demand  points  and  the 
distance  matrix  shown  in  Figure  1.  Suppose  that  customer  demands  are  Poisson  distributed 
with  average  demands  of  =  30,  X2  =  10,  X3  =  30,  X4  =  25  and  that  vehicle  capacity  is  100 
units.  After  some  preliminary  analysis,  the  following  fixed  route  emerges: 
0— *4— ►3  —  2— *1— *0.  We  seek  to  determine  an  effective  operating  strategy.  In  this  case, 
the  dynamic  programming  recursion  yields 

Yi  —  33 

J>2  =  0 

J>3  =  38 

and  the  expected  distance  traveled  under  this  optimal  policy  is  30.59  units.  If  this  operating 
policy  were  not  followed  and  a  vehicle  returned  to  the  depot  only  after  it  discovered  that  custo¬ 
mer  demand  could  not  be  satisfied,  then  the  expected  distance  would  be  32.04  units.  Thus,  the 
operating  policy  results  in  a  4.53%  savings  in  expected  distance  traveled.  Furthermore,  we  note 
that  if  vehicle  capacity  were  infinite,  distance  traveled  would  be  29  units.  The  operating  policy 
is  straightforward  and  easy  to  calculate,  intuitively  appealing,  and  very  effective  in  minimizing 
distance  covered. 
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ABSTRACT 

The  existing  literature  concentrates  on  determining  sharp  upper  bounds  for 
EVPI  in  stochastic  programming  problems.  This  seems  to  be  a  problem 
without  an  application.  Lower  bounds,  which  we  view  as  having  an  important 
application,  are  only  the  incidental  subject  of  study  and  in  the  few  instances 
that  are  available  are  obtained  at  an  extremely  high  cost. 

In  order  to  suggest  a  rethinking  of  the  course  of  this  research,  we  analyze 
the  need  for  bounds  on  EVPI  in  the  context  of  its  significance  in  decision  prob¬ 
lems. 

1.  INTRODUCTION 

The  standard  definition  of  the  expected  value  of  perfect  information,  EVPI,  is  the  max¬ 
imum  amount  that  the  decision  maker  would  pay  for  perfect  prior  information  as  to  the  realiza¬ 
tion  of  the  random  variable  in  a  decision  problem.  This  definition  is  contained  in  much  of  the 
recent  literature  that  concentrates  on  determining  upper  bounds  for  EVPI  in  stochastic  pro¬ 
gramming  problems.  For  example,  Ziemba  and  Butterworth  (6,  pp.  365-66]  note  that  EVPI  is 
the  upper  bound  on  the  "amount  that  the  decision  maker  would  pay  a  clairvoyant"  to  provide 
the  information  but  go  on  to  suggest  that  since  the  calculations  are  formidable,  "it’s  of  interest 
to  find  art  upper  bound  on  EVPI  that  requires  less  computational  effort."  No  attention  is  paid 
to  developing  a  lower  bound  other  than  zero.  (See  also  ]1],  [2],  [4].)  That  upper  bounds  on 
EVPI  and  this  standard  definition  should  be  juxtaposed  is  peculiar.  There  is  little  managerial 
"value"  in  providing  upper  bounds  on  EVPI  within  the  context  of  its  definition.  In  contrast,  a 
lower  bound  on  EVPI,  which  has  managerial  significance,  is  typically  left  at  zero  by  authors 
writing  on  the  subject.  To  illustrate,  suppose  that  we  determined  an  upper  bound  on  EVPI  to 
be  55.  Armed  with  this  information,  how  much  will  we  pay  a  clairvoyant  to  predict  realizations 
of  the  random  variable?  Since  55  in  an  upper  bound ,  it  may  be  unjustifiable  to  offer  55.  To  be 
safe,  we  should  pay  close  attention  to  the  lower  bound.  And  it  seems  reasonable  to  offer  to  pay 
the  clairvoyant  only  up  to  the  lower  bound  for  this  information  since  it  is  the  maximum  amount 
that  we  could  pay  and  still  be  assured  of  justifying  the  expense. 

In  the  next  section,  we  discuss  a  stochastic  programming  problem  from  an  applications 
point  of  view. 
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2.  STOCHASTIC  PROGRAMMING  AND  EVPI 

Consider  a  decision  problem  given  by 

(1)  Max  { E,uifix,  z))|x  €  A } 

where  the  decision,  x ,  must  be  made  prior  to  the  realization  of  the  random  variable,  z.  In  (1), 
x  and  z  may  be  vectors  and  fix,  z)  is  the  accumulated  monetary  wealth  of  the  decision  maker 
subsequent  to  the  random  event  and  u  is  a  utility  function;  A  is  the  set  of  feasible  decision 
alternatives. 

Problem  (1)  can  be  very  difficult  computationally  since  it  may  involve  maximizing  an 
expected  value  of  a  nonlinear  function  of  both  x  and  the  random  variable  z.  One  way  to  ease 
this  computational  burden  is  to  find  and  solve  a  computationally  easier  problem  which  gives 
approximately  the  same  solution  as  (1).  Another  approach  would  be  to  expend  resources  to 
obtain  aforehand  knowledge  of  z,  thus  at  least  eliminating  the  expectation  from  the  problem. 
This  latter  approach  also  provides  an  opportunity  to  produce  a  greater  expected  utility. 

SOLVING  AN  EASIER  PROBLEM 

Suppose  we  solve  an  easier  problem  and  obtain  the  solution  xe.  Suppose  also  that  the  cost 
to  solve  (1)  is  C|.  Then  it  would  be  worthwhile  adopting  xe,  thus  avoiding  C,,  as  long  as 

(2)  Ezuifixe,  z))  >  Max  [Ezujix,  z)  -  C,)|x  €  A}. 

Now  if  an  easily  computed  upper  bound  on  the  RHS  of  (2),  call  it  Ue,  can  be  found,  then  a 
decision  rule  is  available.  It  is  to  adopt  xf  as  the  problem  solution  if  E.uifix,,  z))  >  Ue. 


To  see  how  this  might  work,  consider  an  example  where  u  is  linear  and  /  is  concave  in  z. 
Then  (2)  becomes 

(2a)  C |  >  Max  { EJix ,  z)\x  €  A)  -  E.fixe,  z). 

Let  the  easier  problem  be 

(3)  Max  [uifix,  M))|x  €  A}  =  //(Max  [fix,  M)|x  €  A}) 

where  n  =  Eiz).  Avriel  and  Williams  (1]  used  the  following  inequalities  for  determining  an 
upper  bound  on  EVPI: 


(4)  EJix,  z)  <  Max  [EJix,  z)\x  €  A)  <  £;Max{/(x,  z)|x  6  A|  <  Max  {/(x,  n)\x  6  A} 
where  x  solves  (3).  For  u  linear, 

(5)  EVPI  =  £,  Max  [fix,  z)\x  6  A)  -  Max  [EJix,  z)|x  €  A) 
and 

(6)  Max  [fix,  fi)\x  6  A}  -  EJix,  z)  >  E.  Max  |/(x,  z)|x  €  A) 

-  Max  [EJix,  z)|x  €  A)  =  EVPI. 

If  we  define  xe  =  x,  then  the  LHS  of  (6)  is  an  upper  bound  on  the  RHS  of  (2a)  and  x,  would 
be  adopted  if 


C|  >  fix,,  fi)  -  EJix,,  z). 


. .  . — 


(7) 
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The  RHS  of  (7)  is  relatively  easy  to  compute  since  it  involves  solving  a  deterministic  program¬ 
ming  problem  and  taking  an  expectation. 

That  the  RHS  of  (7)  is  an  upper  bound  on  EVPI  is  incidental.  The  essential  elements  are: 
finding  an  easier  problem  to  solve  and  finding  an  upper  bound  on  (1)  which  provides  the  deci¬ 
sion  rule  (7).  Research  directed  toward  finding  an  upper  bound  on  EVPI  such  as  found  in 
Avriel  and  Williams  [1],  Ziemba  and  Butterworth  16],  and  Huang,  Vertinsky,  and  Ziemba  [4]  is 
clearly  applicable  to  the  decision  problem  posed.  However,  if  we  were  able  to  discover  an 
easier  problem  for  which  an  upper  bound  on  the  RHS  of  (2a)  is  less  than  EVPI,  we  would  be 
exceedingly  happy. 

ACQUIRING  AFOREHAND  PERFECT  INFORMATION 

Assume  perfect  aforehand  information  can  be  acquired  for  a  cost  of  C/  <  Given  this 
information,  one  can  solve  the  wait-and-see  problem 

(8)  Max  r))|x  €  A) 

for  that  value  of  z  which  will  be  realized.  Solving  (8)  is  ordinarily  much  easier  than  solving 
(1).  Suppose  that  the  cost  to  do  so  is  Cw  <  C\.  Then  it  is  desirable  to  acquire  the  informa¬ 
tion  if 

(9)  Ez  Max  [uifix,  z)  -  Cw  -  C,)\x  6  K)  >  Max  { Ezuifix ,  z )  -  C|)|x  €  A}. 

We  define  an  EVPI'  implicity  in 

(10)  E,  Max  | uifix,  z)  -  Cw-  EVPI')|x  €  A)  -  Max  { Ezuifix ,  z)  -  C,)|x  €  A). 

Equation  (10)  is  in  contrast  with  the  implicit  definition  of  EVPI  usually  stated  as 

(11)  Ez  Max  {u(f(x,  z)  —  EVPDlx  €  A}  =  Max  ( Ezuifix ,  z))lx  €  A}. 

If  u  is  linear ,  then  (10)  yields 

(12)  EVPI'  =  Ez  Max  [fix,  z)\x  €  A)  -  Max  [EJix,  z)\x  €  A)  +  C,  -  Cw 
and  (11)  yields  the  definition  of  Equation  (5).  Therefore, 

(13)  EVPI'  -  EVPI  +  C,  -  Cw. 


Now  if  the  acquisition  cost  of  the  information  C/  is  less  than  EVPI',  then  it  is  worth 
obtaining,  i.e.,  if 

(14)  C/  4  EVPI  +  Ci  —  Cw- 

Since  Ci  and  Cw  are  assumed  known,  lower  bounds  on  EVPI  are  significant  to  the  decision 
problem.  Upper  bounds  are  not. 

One  exception  to  the  concentration  on  upper  bounds  appears  in  Huang,  Vertinsky,  and 
Ziemba  [4].  They  show  that  for  a  linear  utility  function,  "sharp"  upper  and  lower  bounds  can  be 
obtained  for  both  z.  =  Ez  Max  [fix,  z)[x  €  A)  and  z„  =  Max  { E.fix,  z)\x  €  A}.  Then  if 
Zp  <  zp  <  Zp  and  zk  <  z„  <  z^,  we  have 

Max  i0,  zpL-znu)  <  EVPI  <  zpu -  z\; 


(15) 
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since  EVPI  —  zp  -  z„,  and  the  problem  becomes  one  of  finding  sharp  bounds  zp,  zp,  zfr,  z{/. 
Considerable  computing  effort  renders  these  bounds  as  sharp  as  desired  by  the  decision  maker. 
Therefore,  we  could  reconstitute  the  decision  inequality  of  (14)  as 

(16)  C,  <  Max  [0,  z/  -  zH )  +  C,  -  Cw. 

When  u  is  strictly  increasing  and  concave,  Ziemba  and  Butterworth  [6,  p.  375]  show  that 

(17)  0  <  EVPI  <  f(x,  n)  -  u~x[Ezuifix,  z))\, 

where  «_l  is  the  inverse  function  of  u.  The  lower  bound  which  is  essential  for  decision  making 
is  left  at  zero. 

Also  for  the  case  of  a  concave  utility  function,  Huang,  Vertinsky  and  Ziemba  [4,  pp.  136- 
137]  show  that  with  additional  assumptions  about  the  probability  distribution  of  z,  one  can 
obtain  either  an  upper  or  a  lower  bound  for  EVPI,  but  not  both. 

Although  Huang,  Vertinsky,  and  Ziemba  make  some  inroads  into  determining  lower 
bounds  on  EVPI,  it  appears  that  such  a  determination  is  entirely  accidental  rather  than  by 
design.  They  appear  to  ignore  the  decision  maker's  real  problem  and  concentrate  on  the  upper 
bound  for  EVPI.  The  research  direction  in  this  area  would  benefit  from  returning  to  its  applica¬ 
tions  base  initiated  by  Dantzig  [3]  and  Madansky  [5]. 

3.  ADDITIONAL  COMMENTS  ON  LOWER  BOUNDS 

For  the  linear  utilit>  case,  by  Equation  (5)  zr  -  z„  defines  EVPI.  In  general,  z„  is  difficult 
to  calculate  as  has  been  the  assumption  in  motivating  discussion  of  the  decision  problem.  If 
the  domain  of  z  is  R  C  Es  (Euclidean  S-space),  the  evaluation  of  zp  involves  maximizing  a 
function  involving  multiple  integrals.  This  may  be  particularly  undesirable  and  resort  to  numer¬ 
ical  methods  may  be  required.  Likewise,  zp  would  also  involve  evaluating  a  multiple  integral 
and  again  resort  to  numerical  methods  may  be  required. 

It  is  instructive  to  return  to  the  paper  by  Huang,  Vertinsky  and  Ziemba  to  discuss  the 
nature  of  the  lower  bounds  on  EVPI  which  they  determine.  The  methods  suggested  consist  of 
approximating  the  integrals  imbedded  in  z„  and  zp  with  summations  by  partitioning.  For  exam¬ 
ple,  consider  R  =  \a,b ]  C  £’.  The  upper  bound  on  z„  would  be  found  by  evaluating 

08)  z“  =  Max  | £  a, fix,  /3,)|x  €  K 

where  a ,  is  the  probability  of  z  belonging  to  an  interval  i  and  /3,  is  the  mean  z  conditional  on  z 
belonging  to  interval  i  ]4,  pp.  130-131],  As  I  —  finer  partitions  are  made  and  z%  — *  z„. 
However,  (18)  is  merely  a  discrete  version  of  (1)  and  solving  it  approximates  the  solution  to 
(1).  Thus,  we  solve  a  problem  almost  as  computationally  undesirable  as  (1)  save  for  the  elimi¬ 
nation  of  an  integral.  « 

The  lower  bounds  on  zp  would  be  found  by  evaluating 

f  < o)  2p  =  X  8<  Max  (/(*•  4)U  €  K ) 

where  8,  is  an  appropriate  probability  measure  and  d,  is  an  appropriate  point  in  interval  i  [4,  pp. 
130-131].  Again,  as  finer  partitions  are  made,  zp~*zp.  But  calculating  z/  involves  solving 
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diireteTrS!Tinr8  Pr°b!fTS  (wh0SC  ValUCS  arC  then  averaged)  Equation  (19)  consists  of  a 
discrete  version  of  z  and  has  as  its  main  computational  virtue  the  elimination  of  an  integral 
Similar  results  hold  for  the  case  of  several  random  variables. 

Since 


<20>  4  ~  z"  <  EVPI  -zp-  z„, 

we  have  a  lower  bound  on  EVPI.  If  we  now  ask  the  question  of  how  much  we  would  be  willing 
to  pay  a  clairvoyant  for  the  benefits  of  perfect  aforehand  knowledge  of  z  ignoring  the  cost  of  cal 

^nnd  frnm'onf''  T"8  demen,S  C>  and  C*  as  factors  in  the  decision,  then  the  lower 

bound  from  (20)  would  help  to  answer  the  question. 

If,  however,  the  computation  costs  are  not  negligible  and  if  the  reason  for  seeking  a  clair¬ 
voyant  was  to  ease  the  computational  burden,  then  we  have  an  interesting  situation  It  is 

"l3Tyant  if  (14)  holds-  But  (,4)  is  Plicated  on  the  assumption 
h  t  C  ,  will  be  avoided  if  the  clairvoyant  is  hired.  But  in  finding  the  lower  bound  on  EVPI  we 

solved  a  prob|e,n  which  was  an  approximation  to  (1)  and  as  that  approximation  got  better,’ the 

mle  “  Z  d  aT°3,Ch  C,;i7hat  is’  Cl  was  not  entirelV  avoided.  Now  if  the  decision 

Hn  »h  ♦  .“r  t0  aV°ld  S°  Vlng  (  )’  we  have  a  dilemma  since  it  suggests  that  we  should  not 

do  what  we  just  (approximately)  did! 
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ABSTRACT 

This  nole  consists  of  developing  a  method  for  enforcing  additional 
constraints  to  linear  fractional  programs  and  showing  its  usefulness  in  solving 
integer  linear  fractional  programs. 


1.  INTRODUCTION 

Fractional  cutting  plane  methods  for  solving  integer  linear  fractional  programs  have  been 
proposed,  for  example,  by  Swarup  [61,  Grunspan  and  Thomas  [31  and  Granot  and  Granot  [21. 
In  contrast  with  these,  the  method  proposed  in  the  sequel  does  not  impose  any  severe  restric¬ 
tion  on  the  problem  as  in  Swarup  [6],  does  not  consist  of  solving  many  integer  programs  as  in 
Grunspan  and  Thomas  [3]  or  does  not  involve  complicated  computations  in  deriving  the  cuts  as 
in  Granot  and  Granot  [21 .  The  results  to  be  followed  consist  of  developing  a  method  for 
enforcing  additional  constraints  to  a  linear  fractional  program  and  then  using  it  for  solving  pure 
and  mixed  integer  linear  fractional  programs  by  cutting  plane  methods. 

2.  METHOD  OF  ENFORCING  ADDITIONAL  CONSTRAINTS 

Consider  the  following  linear  fractional  program: 

n 

I  CjXj  +  « 

(P):  maximize  — - 

£  dj*j  +  p 
j- 1 

n 

subject  to  £  auxJ  “  aio<  /  -  1,2 . m, 

j- 1 

Xj  >  0,  j  -  1,  2, ....  n, 

n 

where  it  is  assumed  that  £  djXj  +  0  >  0  for  all  feasible  solutions. 

7-1 

Suppose  that  a  basic  feasible  solution  to  ( P )  is  known  and  with  reference  to  this,  let  the 
constraints  be 
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(1)  XB  -  b,  +  £  y0(-Xj),  i  -  1,  2,  ....  m, 

jiN 

Xj  >  0,  y  -  1,  2,  ....  n, 

where  xB ,  i  —  1,2,...,  m  are  the  basic  variables,  N  is  the  index  set  of  nonbasic  variables  and 
the  basic  solution  is  obtained  by  putting  x}  —  0  for  every  j€N  in  (1).  Let  z1,  z2  be  respective¬ 
ly  £  cB,b<  +  “,  £  dB  b,  +  /3,  z/  -  £  cfl  y,7,  z/-  £  dBy0  and  A,  «  |z2(z,' -  Cy)  z'(z/ - 

i-l  /-I  i-l  i-l 

dj))/z 2  be  calculated  for  all  j  €N  .  Let  the  additional  constraint  to  be  appended  be 

(2)  £  PjXj  >  q. 

Substituting  for  xB  ,  /  -  1,2,  .  .  .  ,  m  from  (1)  in  (2)  let  (2)  take  the  form 

(3)  x„+i  -  bm+l  +  £  yn+l  (-Xj)  >  0. 

jiN 

Let  us  denote  by  (/*j)  the  new  problem  obtained  from  ( P )  by  appending  (3).  Then 

xB,“bi,  j  _  i,  2,  . . .  ,  m  +  1 

xj-  0,  j  €  N 

is  a  basic  solution  to  (P,)  where  we  designate  n  +  1  for  Bm+l.  Let  c„+,  and  d„+}  be  assigned 
the  values  zero. 

The  problem  of  interest  here  occurs  when  bm+i  <  0.  To  establish  the  validity  of  the  suc¬ 
cessive  steps  of  the  m'ethod  to  be  followed  in  reoptimizing  the  problem  (Z^),  we  require  the 
following  lemmas,  the  proofs  of  which  can  be  constructed  with  the  help  of  Hadley  [4], 

LEMMA  1.  If  bm+x  <  0  and  if  the  set  of  feasible  solutions  to  (/>])  is  nonempty,  then 
there  exists  a  j  €  Wsuch  that  ym+XJ  <  0. 

LEMMA  2.  For  every  j  €  N. ,  there  exists  an  i  €  {1,  2.  ....  m)  such  that  ytj  >  0. 


ALGORITHM  1. 


STEP  1.  Set  up  a  tableau  giving  xB,  b„  y,j ,  z}  -  c,,  zj  -  d,A,  for  /  -  1,  2 . m  and  j  €  N. 

Compute  z1,  z2  and  z.  Append  the  additional  constraint  in  the  form  (3). 
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STEP  2.  If  b„t[  <  0,  set  7-0  and  go  lo  slop  3.  Otherwise,  scl  7  —  1  utul  go  lo  step  8. 

STEP  J.  Set  M  -  {./I./  t  N.  v,,i  !  <  0|.  If  M  is  empty,  no  feasible  solution  exists,  stop. 

STEP  4.  Compute  — -  Min  1  — ,  v„  >  o|  for  every  ./  €  N.  Set  -  j./|>  €  Af. 

y.  UU  -*!'•’  ■«)  \y„  I  l 

ft ,,, ,  |  A , j  )  I 

V  If  Af)  is  empty  go  to  step  ft  Otherwise,  set  7  -  I,  r  -  m  +  I  and 

y«  1 1  j  y*{i) , i  j 

go  to  step  5. 


STEP  5.  Eind  k  such  that 


t.  r  -  Min 

'*.w, 

,* 1  (:;’  </,)  and  go  to  step  7, 


>i 


A, 


•wllil 


/«•!./  <  0}  where  /*(U  -  r«o.,  “ 


STEP  ft  Eind  k  such  that  »w(,  k  -  Min  vwtl  If  vw(,  *  >0  no  feasible  solution  exists,  stop. 

'  mV 


Otherwise,  Und  r  such  that 

N 


-  Min 

y»*  m|I.1.  "I 


— .  y*  >  0 
y,k 


STEP  7.  Do  a  simplex  pivoting  to  obtain  a  new  basic  solution  by  replacing  \N  by  a,  in  the  set 
of  basic  variables.  Modify  \ ,  calculate  A,.  c,,  -  </,  for  all  /  t  ,V  by  modifying 

their  dctlnitions  to  take  the  summation  from  I  to  m  +  I.  Also  calculate  und 
If  7-0  return  to  step  3. 


STEP  8  If  A,  >  0  for  all stop;  the  optimal  solution  is  \w  - 

b, 

x ,  —  0  tor  I  t  N.  Otherwise  let  A*  -  Min  A,,  — 

mV  3V* 

return  to  step  7. 


ft,  for  i  —  l.  2.  . 

f  ft 

Min  , 

M  |l  J  •«  t  III  ,V,» 


m  +  I .  and 
'  .  v,*  >  o|  and 


3.  FRACTIONAL  CUTTING  PLANE  METHOD  EUR 
INTEGER  I. IN  EAR  ER  ACTION  Al,  PROGRAMS 


As  an  application  of  Algorithm  I  we  present  in  this  section  a  culling  plane  method  for 
solving  integer  linear  fractional  programs  The  method  for  (mixed!  integer  linear  fractional 
programs  follows  closely  tlomory’s  fractional  cutting  plane  method  for  (mixed)  integer  linear 
programs  and  can  briefly  be  described  as  follows. 

Solve  the  problem  obtained  by  omitting  the  integer  restrictions  If  the  solution  satisfies 
the  integer  restrictions,  then  it  is  optimal;  otherwise,  introduce  a  Clomory'x  (mixed  integer) 
fractional  cut  and  rcoplimi/c  by  using  Algorithm  I  and  repeut  the  process 


REMARK  Enforcing  a  single  additional  constraint  to  a  linear  fractional  program  can  be 
done  easily  by  using  the  dual  simplex  method  after  employing  the  Charnos  and  Cooper  |l| 
transformation.  But  in  the  case  of  integer  linear  fractional  programs,  the  transformation  des¬ 
troys  the  Integer  nature  of  the  variables  and  Clomory's  fractional  cuts  in  their  usual  forms  can¬ 
not  be  used  However,  cuts  can  be  obtained  as  given  by  Clranot  and  tiranot  |2|,  but  it  can  be 
observed  that  such  derivations  of  cuts  involve  more  complicated  compulations  than  those  in 
our  case 
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Since  the  objective  function  is  not  integer  constrained,  the  proof  of  finiteness  of  the  above 
method  cannot  possibly  be  given  in  a  manner  similar  to  that  of  Gomory’s  methods. 
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